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(57' Abstract 

Th- present invention relates to chimeric transcriptional 
activators'. The chimeric transcription activator protein contams 
at leas: cr.e composite transcription activation domain. ( TAD , 
and" leas' on, DBD. The compos:* TAD comprises a continuous 
Sv w£e region canning two or more ^"^^^ 
regions (also referred to as "act.vaLon tags al 
of whxh do not occur together m the same gene product in 
nature" In certain of these embodiments the chunenc protetr. 
™ses a hgand binding domam tor allcsanc activation. e* 
receptor domain capable of binding a cell P*™"^^ 
a, tetivclmc or an analog thereof or a steroic such as RL 486 or 
ecJvsc^e. for example. In such cases, the c^enc^npuor 
activator selectively activates crar.scr^tion of a target gen m 
-V -reserce of the respective ligand In other em.oodiments J* 
ch-m-rc inscription acuvator is capable cf activating target gene 
expression consntutivelv. i.e.. wuncui the need for tne presence of 
any such agaric. 
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Background of the Invention 

A large number of bioioaical and clinical protocols, among others, gene therapy, 
production of biological matenals. and biological research, depend on the ability to then 
specific and high-level expression of genes encoding RNAs or proteins of therapeutic, 
commercial or experimental value. A variety of express.on systems have been developed, 
including regulated express.on systems, involving allostenc on switches triggered o> 
tnracvcline RU486 ana ecdvsone. as well as d.rnenzat.on based on-off switches triggered 
bv FK10P FK-CsA. rapamvc.n and analogs thereof. See e.g. Clackson, 'Controlling 
mammalian ecne expression with small molecules" Current Opin.on in Chem.cal B.ology 
1997 1-10-218. Still, achieving a sufficiently high level of expression for clinical or 
othe ? mi htv in ceneticaiiv engineered cells in various contexts, including within whole 
or 2 aru<ms"has oner, been a limiting problem. Various approaches for addressing tnis 
problem .nclud.nc the search for stronger transcr.pt.onal promoters or higher transfecuon 
efficiencies have in mam cases not met with success. Meanwhile, in various lines of 
research with transcription factors, promising results in transient transfecuon models have 
not been borne out with chromosomally integrated reporter gene constructs. Furthermore, 
overexpress.on of transcription factors is commonly associated with toxicity to the host 
cell Despite those prececer.ts. this invention taxes a novel approach to the challenge of 
opt,m,zine gene expression through new uses of. and new designs for. transcription factor 
prof.ns which are expressed within the engineered cells containing the target gene. The 
invention provides improved methods and materials for achieving high-level expression of 
a target gene m genetically eng.neered cells, including genetically engineered cells wit.™ 
whole organisms 



Summary of the Invention 

in the course of our research on transcription activation, we nave quite 
unexpectedly discovered an important phenomenon which may explain the absence ct 
greater proeress in the discover, and-or optimization of more powerful transcription 
'activating domains. Briefly, we have made the unexpected discovery that in conventional 
transient transfecuon experiments of tne sort typically used in this field to stua> 
transcription activation, as the number or anparen, potency of activation domains 
increases the observed level of transcription increases to a maximum and then levels on 
or decreases ,n a phenomenon termed -squelching-. However, when analogous 
experiments are conducted using stably incorporated reporter genes in place o: trans.entlv 
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transfected reponer genes, squelching is avoided, and instead, an unimpeded structure- 
activitv relationship can be observed for various transcription activate doma.ns and 
mo dif.cauons thereof. With the banners othenv.se .mposed by squelchmg now removed, 
we have made significant advances in the des.gn and implementation of transcr.pt.on 

5 activation doma.ns in ch.mer.c proteins and transcription systems. 

Th.s invention encompasses nucle.c acid constructs encoding vanous chimeric 
ascription activator prote.ns .also referred to below as "composite activators", and in 
,ome embodiments auxiliary chimeric prote.ns. as well as related materials ana methods 
as disclosed m greater detail below, A number of Ulustrauve embodiments are hignhghted 

10 beiow 

Po- example, a number of embodiments of the invention involve a nucle.c acid 
encoding a ch.mer.c transcription activator prote.n wh.ch activates the transcription of a 
ce ne to which the chimeric transcription activator protein is targeted. Targeting may be 
eff-ctcd via one or more DNA-b.nding domains f'DBDs") which b.nd to a DNA sequence 
„ to wh.ch a target gene is operatively linked. The DBD(s) may be present w.thm the 
chimenc transcription activator protein or may be provided by an auxiliary- chimeric 
protein as disclosed in extensive detail below. 

Ia certain embodiments of the invention the chimeric transcription activator prote.n 
contains at least one composite transcr.pt.on activation domain ("TAD") and at least one 
DBD The composite TAD comprises a continuous polypeptide region containing two or 
n.ore component poivpept.de regions .also referred to below as "activation tags"), at .east 
two of wh.ch do no; occur together in the same gene product in nature. In certain en tnese 
embodiments the chimeric prote.n comprises a iigand binding domain for alloster.c 
activation, e.e. a receptor domain capable of binding a cell permeant hgana such as 
^5 tetracycline or an anaioc thereof or a steroid such as RLU86 or ecdysone. .or example In 
sucn cases, the chimeric transcription activator selectively activates transcnpt.on of a 
lareet gene ,n the presence of the respective ligand. In other embodiments me chamenc 
transcription activator is capable of activating target gene expression const.tut.vdy. i.e.. 
without the need for the presence of any such iigand. 

In other embodiments, the chimenc transcription activator prote.n conta.ns at least 
one composite TAD comprising three or more component polypeptide reg.ons. at least two 
of which do not occur together in the same gene product in nature, and at .east one 
additional domam which ,s heterologous with respect to at least one o. the «mpcnem 
polvpept.de ree.ons of the composite TAD. The add.uonal domain;*! may inciuce a DBD 
or a receptor domam for an ol.gomer.z.ng iigand. OHgomertzmg iigands are multivalent, 
nre.-erablv cell permeant. compounds, generally having a molecular weight beiow about 5 
kD* and -referabiv below about 2 kD. which mediate the formation of complexes w.th 
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nrotems coniainuiB receptor domains to which the hgands binds. Non-limiting, illustrative 
examples of oheomenzmg Hgands include FK1012 (with respect to proteins containing 
FKBP domains), coumermycin ^with respect to prote.ns containing DNA Gyrase 
domains), fuiisponn (with respect to prote.ns containing cyclophii.n and FkBP domains. 
5 respectively) and rapamycin (with respect to proteins containing FkBP and FRAP 
domains, respectively). 

In still other embodiments, the chimeric trar.scnpt.on activator protein contains at 
least one composite TAD comprising a continuous polypeptide region containing two or 
more component polvoepude regions, at least two of which do not occur together ,n the 
,o same E ene product in nature, or at least not in the same order, arrangement or numoer as 
found in the chimeric protem of this invention, and at least one additional domain wh.cn is 
h-terologous with respect to a, least one of the component polypeptide reg.ons ot the 
composite TAD In some of these embodiments, the components are all of human origin 
In other of these embodiments, at least one of the heterologous domains comprises a 
,s domain derived from an .mmunophiiin. cyclophii.n, caicineur.n. FRAP or DNA gyrase 
domain; a domain derived rrom a receptor for tetracycline or ecdysonc or another steroid, 
or a composite DNA binding domain. 

In still otner embodiments, the chimeric transcription activator protein contains at 
leas' one transcription activation domain, which may be a previously known transcription, 
activation domain such as VP16. a novel truncated p65-derived activation domain 
disclosed in detail below or a composite TAD as described herein, and at least one 
bundlme domain A bundling domain is a domain permitting assembly of complexes ot 
rwo or more prote.ns each of which comprises a copy of the bundling domain or a cose 
variant thereof. Non-limiting examples of bundling domains include domains derived rrorr. 
- c5? or the E cob. lac repressor as well as various leucine zipper domains Bundling 
domains are distinguished from heterool.gomenzing proteins such as the 
FKBP/calc.neurm pair, the FKBP/FRAP pair, the cyclophuir, calcineurm pair, the 
RXRTBP pair, for example, which form heterooiigomers of proteins containing verj 
different bindinc domains In contrast, bundling domains permit oligomerizatior. between 
jo proteins containing the same bundling domains or very similar variants thereof and uo so 
without the need for ligand-mediation. 

In various embodiments, one or more of the DBD. receptor domain, ol.gomenzing 
i.sand bindine comain or other additional heterologous domain is heterologous w.tn 
r-sne- to at least one of the component polypeptide regions of the composite transcription 
j< activation domain. In other embodiments, one or more of such additional heteroiogous 
domains is heteroloeous with respect to at least two of the component polypeptide regions 
of the composite transcrirt.cn activation domain. In one embodiment, the composite 
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activator protein includes at least two different activation tags from heterologous sources, 
e E acnvat.on taes which are derived from two or more different transection factors or 
cJ-ac-vators and which do no: naturally occur together in the same protein While no: 
wishing to be bound bv any particular mechanistic hypothesis, two or more ot the 
< acvation ta E s of a composite activator, more preferably two which are derived from 
disparate proteins, mav recruit TAFs which are not required for basal activation of a gene 
Through the activation tags, the chimeric activator ma> recapitulate the interaction of 
several otherwise discrete factors with the polymerase complex, or prov.de novel contacts 
with tha. complex, which interactions may recruit and/or stabilize the formation of 
i(. activator-dependent transcriptional complexes. 

In this context, it has also been discovered that a variety of activation tags can be 
repeated multiple times in the same chimeric protein, or multiply associated therewith. 
w,th a concomitant increase in transcriptional activation. The repetitive activation, tag 
embodiment can be combined with other activation tags. e.g.. from the same or 
! ; heterologous sources as the repeated activation tag(S). 

Another asnect of the invention relates to the identification of novel activation tags 
In mis recard. an activation tag has been identified in the NF-kB transcription factor 
subun.t P 65. The so-called -alanine-proline rich" or "AP" activation tag of P 65 extencs 
from about amino acids 361 to about ammo acd 450 of that protein. See SEQ ID No. 2. 
^0 Similar AP activation tacs are also present in, e.g.. the p53 and CTF proteins. The present 
invention also contemplates fragments of P 65 (or homologous sequences tnereto. e.g.. 
from p53 or CTF. which are about 75. 60. 50. 30 or even 20 amino acd residues in length, 
in other embodiments, the AP activation tag has an am.no acid sequence at least 95%. 
90%. 80% or 70 c o identical to the AP activation tag of SEQ ID No. Z. 

The subject chimeric activators can be used to drive high levels of transcription 
from naturallv-occumne. or otherxv,se genomicalh -integrated genes The chimeric 
activators of the invention are particularly useful for activating transcription o: integrated 
single copv genes, which in the past have no, successfully transacted at appreciate 
levels In preferred embodiments the level of expression of a chjomosoma.h -integrates 
,0 tareet cene achieved with me novel and- or composite transcription act.vat.on dcr.a.ns 
disclosed herein :s at least two-fold, preferably three-fold, more preferably t,ve-,o.d. and 
optimallv ten-fold or better greater than expression levels achieved using the YPIO comam 
as the transcript.cn activation domain in the analgous expression system, as measurea ,n a 
scientifically valid comparison. 

This invention also encompasses nucleic acid compositions comprising a firs, 
ruc'e.c acd encodme a chimeric inscription activator protein wh.ch contains one or 
more heand-bmdme doma.ns for an ol.gomenztng hgand such as described above ana a 
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second nucieic ac.d encoding a chimeric DNA-bind.ng protein which composes a: least 
one DNA-bindin C domain and at least one hgand binding domain for a cell permeant 
l 12 and Thus, the chimeric proteins encoded by these nucle.c acids eacn contain one or 
more iigand-bindme domains for binding to an ohgomerizmg ligand which mediates the 
dimerization or h.cher-order hetero-ohgomenzat.on of the chimeric proteins. In some 
embodiments, one or both of the chimeric proteins contains two or more hgand-b.dmg 
domains The nucleic acid composition may further compr.se a target gene construct 
comprising a target gene operatively linked to a transcriptional regulatory element wn.ch 
includes a DNA sequence to which the chimeric DNA-binding protein binds. 

This invention also encompasses nucleic acid compositions comprising a first 
nucieic acid encodinc a chimeric transcription activator protein which contains one or 
more composite TADs and one or more DBDs. such as described above, and a second 
nucieic aod comprisinc a target gene construct comprising a target gene operatively linked 
to a transcriptional regulator, element which includes a DNA sequence to which the tne 
5 chimeric transcription activator protein binds. 

A nucleic acid encoding a chimeric protein of this invention may be operably 
linked to a transcriptional regulatory element permitting expression of the chimeric protein 
in cells. The various nucie.c acids may be provided in DNA vectors as disclosed below. 

The invention also encompasses methods for engineering cells for the regulated or 
•0 constitutive expression of a heterologous target gene. That method involves introducing 
nucleic acds o: nucleic acid compositions of this invention into the cells using methods 
and materials permitting uptake by the cells of the nucleic acds. In embodiments oi 
particular interest, me tarcet gene is integrated within the chromosomes ot the host cells 
In some cases the transfected ceils are selected and separately recovered trom ncn- 
- transfected cells in certain embodiments the ceils are grown in culture. In some 
embodiments the cells are engineered m vivo .within a whole organism), while in other 
embodiments cells are transfected in vitro and the transfected cells and/or their orogeny 
are subsequently introduced into whole organisms. In the latter case, the engineered cells 
may be encapsulated prior to introduction into the organism. 
-,0 Engineered cell; which contain one or more nucieic acids or nucieic acid 

compositions of this invention are also encompassed. Again, engineered ceils in which the 
target gene is stably integrated w.thm the cell's chromosomes are of particular interest 

This invention fanner encompasses metnods for effecting expression of a target 
cene which comprises maintaining genetically engineered host cells as described herein 
under conditions suitable for gene expression. In regulated expression embodiments, this 
wili involve contacting the cells, m vitro or m v.vo. with the hgand or oligoir.cnr.mg agent 



PCTYUS9"/15:i9 

WO 99.10508 

- 6 - 

which bines to one or both of ihe chimeric proteins. 

Applications of this invention include transcnpt.on of genes, constitutively or in a 
drua-dependent manner in v.tro. e.g. for the production of a desired protein which may be 
separated recovered, for achieving higher levels of expression in transcription based 
assavs uncludinc rwo-hvbrid assays), and for the regulated expression of recu.red v.ral 
genes ir producer celis lines used for production of recombinant viruses (e.g. for the 
Regulated expression of AAV rep and/or ca P genes in host cells used for the production of 
recombinant AAV, Other applications include in vivo applications sucn as . the 
constitutive or reflated expression of a target gene of interest in an animal model .e.g. for 
^search or veter.narv purposes, as well as for the constitutive or regulated express.on of a 
target eene of interest in a numan subject, e.g. in the case of gene therapy. In the case ol 
human gene thera P > . it will often be preferred that the components of the chimeric proteins 
be of human origin and or mat the engineered cells be encapsulated. 

Other features ana advantages of the invention will be apparent from the following 
detailed description and claims. 

Brief Description of ihe Figures 

Figure !A shows the level of reporter gene expression (in EAP Units) from the 
Plasrmd 5XGAL--1L2-SEAP transiently transfected into HT1080 cells cotransfected with 
) various amounts of GAH- F 65 . diamonds, or GAL4-VP16 (squares) express.on constructs 
(Activator DNA'i Mean values of SEAP activity secreted into the medium are snown ( - 
S.D X 

Figure IB shows the level of reporter gene express.on un EAP Units, from a cione 
of HT1080 cells .KTIOSOB^ having an integrated plasmid cotransfected with various 
5 amounts of GAL-4-r65 (diamonds) or GAL4-VP16 .squares, expression constructs Mean 
values of SEAP activity secreted into the medium are shown (*/- S.D.V 

Figure 1C snows the level of reporter gene expression (in EAP Units', m a cool of 
hundreds of independent HT1080 clones carrying an integrated piasm.d P LH-5xGAL4- 
IL~-SEM» cotransfected with various amounts of GAL4-p65 .diamonds) or GAL-A PI6 
SO (squares, expression constructs Mean values of SEAP activity secreted into the medium 
are shown (->-/- S.D.). 

Figure 2 snows the transcriptional activity of the integrated SEAP gene rrom pLH- 
«xGAL4-IL2-SEAP plasmid in the presence squares) or absence t.circies . of co- 
transfected 5xGAL4-IL2-hGH reporter plasmid. as well as the transcriptional activity ct 
3 s the transient transfected 5xGAL4-IL2-hGH reporter plasmid .mangles, in HT108C ceils 
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cotransfected with vanous amounts of GALl-p65 express.on construct (Activator). Mean 
values of SEAP activity and hGH protein secreted into the medium are shown <— S.D. >. 

Figure 3A is a diagrammatic representation of rapamycir.-mduced d.merization o: 
rwo fusion protems. one containing the GAL 4 DNA binding domain fusee tc FKBP12. 
5 and the other containing the p65. activation domain fused to FRB. thereby leading to 
stimulation of target gene expression. 

Figure 3B is a diagrammatic representation of rapamycin-induced dimension of 
two fusion proteins, one "containing a GAL4 DNA binding domain fused to the three 
copies of FKBP12. and the other containing the p65. activation domain fused to FRB. 
0 thereby leading to association of three p65 activation domains with each GAM mononer 
in the presence of rapamycm 

Figure 3C is a diagrammatic representation of rapamycin-induced dimenzation ot 
two fus,on proteins, one containing a G ALU DNA binding domain fused lo one copy of 
FKBP12. and the ether containing the "bundled" fusion protein. RLS. which contains the 
; 5 tetramenzation domain of lactose repressor between FRB and the P 65 activation domain, 
thereby leading to association of four activation domains with each FK3P in a rapamyem- 
aependent manner 

Figure 4 A reoresents the level of reporter gene express.on (SEAP Units) of a stabl> 
integrated P LH-5xGAL4-IL2-SEAP plasm.d in HT1080B cells cotransfectec with a 
20 plasmid encoding a GAL4DNA binding domain linked to 1. 2. 3. or 4 FKBP:2 proteins 
(GF1 GF2. GF3. and GF4. respectively) and a either a plasmid encoding FR3 rased to 
n 6 5 activation domain (RS. or a plasmid encoding FRB fused to the E. coh tetramenzation 
domain and p65 activation domain (RSL. in the presence of 10 nM rapamvem. Mean 
values of SE.AP activity secreted into the medium following addition of 10 nM rapamycm 
25 are shown (+- S.D ). ADDBD ratio indicates the ratio between the amount of plasmid 
encoding a encoding an activation domain (AD) and a plasm.d encoding a GAL- DNA 
binding domain <DBD1 

Figure JB represents the level of reporter gene expression (SEAP Units, of a 
transiently transfected 5xGAL4-IL2-SEAP plasm.d in HT1080 cells cotransfected w.tn a 
30 plasmid encoding a GAL4DNA binding domain linked to 1. 2. 3. or 4 FRBP '. 2 protems 
(GF1. GF2. GF3. and GF4. respectively! and a either a plasmid encoding FRB msec to 
n65 activation doma.r. (RS) or a plasm.d encoding FRB fused to the E. coh tetramenzation 
domain and p65 activation doma.n (RSL: m the presence of 10 nM rapamycm Mean 
values of SEAP activity secreted into the medium following addition of 10 nM rapamycm 
35 3re s hown (-<- S.D ). AD DBD ratio indicates the ratio between the amount of p.asm.d 
encoding a encoding an activation domain (AD) and a plasm.d encoding a GAL- DNA 
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binding domain iDBD). 

Figure 5 shows the level of reponer gene expression (SEAP Units) of an integrated 
SEAP gene in HT10S0B cells transiently transfected with varying amounts of GAL4-p65 
plasmid in the absence .square, or presence of either 120 nM trichostattn A (diamond) 1 
mM sodium butyrate .circle ,. Median values of SEAP activity secreted into me medium 
are shown (-.*- S.Dj. 

F.gure 6 shows the level of reporter gene expression (SE.AP Units) of an integrated 
SEAP gene in HT1080B cells transiently transfected with expression vectors encoding 
GAL4 fus.on proteins having an activation domain from various transcr.puon factors. 

Figure 7 snows the level of reporter gene expression . SEAP Units') of an integrated 
SEAP gene in H71080B cells transiently transfected with expression vectors encoding 
GAL4 fus.on proteins having no activation domain (G only i; 1. 2. or 4 activation domains 
from VP16 (GYP16.V.. G VP 1 6X2. and GVP16X3, respectively). 1. 2. 3. or - activauon 
domains from P 65 .GpoSXL Cp65X2. G P 65X3, and Gp65X4. respectively): o: a 
combination of an activation domain from VP 16 and an activation domain from pt» 
(Gp65 - VP 16) 

F.gure 8A shows the level of reporter gene expression (SEAP Units) of an 
integrated" SEAP gene in HT1080B cells transiently transfected with expression vectors 
encoding GAL 4 fusion proteins having 2. 4. 8. or 12 copies of the V8 (GVSX2. GV8X4. 
GV8X8. and GVSX12. respectively). 

F.gure 8B shows the level of reporter gene expression .SEAP Units, of an 
integrated SEAP gene in HT1080B cells transiently transfected with expression vectors 
encoding GAL- fusion proteins having 1. 2. 3. 4. 5. or 6 copies of the Vc (GVCX2. 
GVCX3. GVCX4. GVCX5. and GVCXe. respectively . 

F.gure SC shoves the level of reporter gene expression .SEAP Units, of an 
integrated SEAP gene ,n HT1080B cells transiently transfected with expression vectors 
encoding GAL 4 fus.on proteins having 8 VS copies (GVSXS), 5 Vc copies (GVCXf .. S 
copies of VS and 5 cop.es of Vc (GVSXS - GVCX5). 8 copies of VS and 5 cop.es of Vc :n 
the reverse order t GVCXf - GVSXS or VP 16 iGVPi6V 

F.gure 9A shows the level of reporter gene expression .SEAP Units, o: an 
integrated" SEAP gene in HT1080B cells transiently transfected withexpression vectors 
encoding GAL- fusion proteins having ammo ac.ds 450-550 of p65 (Opof I-5C-550)). 
2. or 4 cop.es ofam.no acids 361-450 of p65: or 1. 2. or 4 copies of an activation domain 
from Spl (is this correct"). 

F.gure 03 shows the level of reponer gene expression .SEAP Units, of an 
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integrated SEAP gene in HT1080B cells transiently transfected with expression vectors 
encoding GAL4 fusion proteins having an Spl activation domain alone (GSi or together 
with one or two copies of the AP domain. 

Figure 10A shows the level of reporter gene expression (SEAP Units, of an 

5 integrated" SEAP gene in HT1080B cells transiently transfected with expression vectors 
encoding GAL4 fusion proteins various activation domains or combinations thereof 

Figure 10B shows the level of reporter gene expression (SEAP Units) of an 
integrated" SEAP gene in HT1080B cells transiently transfected with expression vectors 
encoding GAL 4 fusion proteins various activation domains or combinations thereof. 

10 Figure 1 : is a schematic representation of a composite activator. 
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Detailed Description of the Invention 
1. General 

Protein-encod.ng genes in eukaryotes are transcribed by RNA polymerase II (pol 
II). a multisubum: enzyme that is brought to an appropriate gene promoter (pol U 
promoter) through tne assembly of a pre-miuation complex comprising a number of 
general transcription factors The multisubunit protein complex TF1ID is required for 
transcription by most, if not ali. promoters targeted by pol II. Whereas the TATA-box 
binding protein (TBP) of TFIID is sufficient for basal transcription, pol II transcription is 
also regulated by gene-specific activator proteins. Activator-dependent transcnpnon 
requires, inter alia. TBP-associated proteins (TAFs) and other transcriptional cofactors. 
One of the important concepts to emerge from studies of eukarvotic gene expression is thai 
activators of pol Il-dependent transcription are composed of functional modules whose 
abilities to bind to subumts of the ultimate pol II complexes regulates transcriptional 
activity of a nearby gene 

The present invention pertains to nucleic acid molecules and proteins which can be 
used to regulate the expression of genes in eukaryotic ceils. 

One aspect of the present invention relates to chimeric transcriptional activators 
("composite activators") which are derived to include a multiplicity of heterologous 
"activation tags" (further defined infra,, e.g.. polypept.de sequences capable of affecting 
transcriptional activation, as for example, affecting the assembly or stability of an active 
polvmerase complex It has been discovered that activation tags nom disparate prote.ns 
can' be combined in a s.ngle polypeptide, or artificially recruited by a complex to a single 
recognition element and retain the ability to synergisucalh activate transcription. In its 
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s.mpiest elaboration, the chimeric activator includes at least two different activation tags 
from heterologous sources, e.g.. activation tags which are derived from two or more 
different transcription factors or co-activators and which do not naturally occur together ,n 
the same protein. As described further in the appended examples, constructs ot this type 
are observed to form potent transcnpt.onal activators with a variety of uses Accordingly, 
the invention provides composite transcriptional activators which are derived with 
activation tags from at least two separate proteins. 

in this context, it has also been discovered that a variety of activation tags can be 
repeated multiple times in the chimenc protein, or multiply associated therewith, with a 
concomitant increase in transcriptional activation. This result was unexpected in light ot. 
as an example, the belief that squelching observed in the systems of the prior art was a 
result of a rate limiting step involving the availability of general transcription factors, e.g.. 
those of the TFIID complex Accordingly, another type of composite activator of the 
present invention is one wherein multiple copies of an activation tag are repeated, e.g.. 
from 2 to 20 times, m a given composite activator. The repetitive activation tag 
embodiment can be combined with other activation tags. e.g.. from the same or 
heterologous sources as the repeated activation tag(si. As described with greater detail 
herein, in either of the above embodiments, the activation tag can itself induce activator 
dependent transcription. Alternatively, the activation tag may be one which alone is 
3 insufficient to inauce activator-dependent transcription, but rather produces a synergistic 
effect when provided with a second activation tag which itself has some ability to induce 
act.vator-dependent transcription. The composite activators of the present invention may 
be generated to include DNA binding domains, ligand binding domains (LBDsi and or 
oligomsrization domains (ODs). 

Another aspect of the invention relates to the identification of novel activation tags 
In this recard. an activation tag has been identified in the NF-kB transcription factor 
subun.t p65. The so-called -alanine proline rich" or "AP" act.vat.cn tag of p65 extends 
from about ammo acids 361 to about ammo acid 450 of that protein. See SEQ ID No. 2 
Similar AP activation tags are also present in. e.g.. the p53 and CTr proteins As 
0 described in the Examples, the presence of one or several copies of the .AP domam alone 
in a protein does not prov.de the ability to induce activator-dependent transcr.puona. 
activation. However, when linked to activation tags svhich are themselves capable or 
inducing some level of activator-dependent transcription, e.g.. another portion of p65 or 
VP 16. die AP activation tag synergizes with the second activation domain to induce an 
-5 increase in the level of activated transcription The AP activation tag can be usea. for 
example, to construct chimenc transcription factors, to generate amg screening assays, or 
as a competitive inhibitor of p65 or other transcription factors which utilize the AP 
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acuvation tag to form transcnptional complexes. 

Still another aspect of the present invention relates to the observation that, in 
contrast to eoisorr.ai genes, the level of transcription of an integrated gene is directly 
related to the number of activation tags that can be delivered to the gene. While no: 
wishing to be bound by any particular mechanistic theory, the inhibition of transcription of 
an episomal gene in the presence of multiple activation unit might result from the 
sequestration by the activator (unbound to DNA) of either (i) an adaptor component 
necessary to bridge the activator with the basal machinery, and/or (in a basal component 
In contrast, transcription of an integrated gene appears to be enhanced by delivery of 
additional activation units to the promoter sequence(s) of the gene. Thus, the subject 
chimeric activators can be used to drive high levels of transcription from naturally - 
occumng. or otherwise genomically-mtegrated genes. The chimeric activators of the 
invention are particular^ useful for activating transcription of integrated single copy 
2 enes. which in the past have not successfully transactivated at appreciable levels. 
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II. Definitions 

For convenience, the meaning of certain terms and phrases employed in the 
specification, examples, anc appended claims are provided below. 

As used herein, the term "gene" or -'recombinant gene" refers to a nucleic acid 
molecule comprising an open reading frame and including at least one exor. and 
(optionally) an intron sequence The term, "intron" refers to a DNA sequence present in a 
given gene which is not translated into protein and is generally found between exons. 

As used herein, the term nucleic acid" refers to polynucleotides such as 
deoxyribonucleic acid ( DN A ). and, where appropriate, ribonucleic acid iRNA). The term 
should also be -understood to include, as equivalents, derivatives, variants and analogs of 
either RNA or DNA made from nucieoude analogs, and. as applicable to the emcod-.ment 
being described, single (sense or antisensei and double-stranded polynucleotides 

Tne term "operabiy linked- when referring to a transcnpuonal regulator, sequence 
and a coding sequence is intended to mean that the regulatory sequence is associated with 
the coding sequence in such a mariner as to facilitate transcription of the coding sequence 
in an activator-dependent fashion. 

Tne terms •'protein", "polypeptide' and 'peptide" are used interchangeably herein 
when referring to a gene product, e.g.. as may be encoded by a coding sequence 

"Transcriptional regulatory sequence", also termed herein "regulator, element", 
••regulatory sequence" or "regulator} element", are generic terms used throughout the 
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spccificauon to refer to DNA sequences, such as initiation signals, enhancers, ana 
promoters which induce or control transcription of protein coding sequences wm, wn.cn 
thev are operabN linked. The term "enhancer-, aiso referred to herein as "ennancer 
element", is intended to include regulator elements capable of increasing, stimulating, or 
< enhancing transenpuon from a basic promoter. The term, -silencer-, aiso reterred to 
herein as "silencer element" is intended to include regulator, elements capaoie ot 
decreasing, inhibiting, or repressing transcription from a basic promoter Regulatory 
elements can also be present in genes other than in 5" flanking sequences. Thus. ,t is 
possible that regulatory elements of a gene are located in inrrons. exons. coding regions. 
10 and 3' flanking sequences. 

The terms -basic promoter'" or ■•minimal promoter", as used herein, are intended to 
refer to the minima! transcriptional regulatory sequence that is capable of initiating 
transcnpt.on of a selected DNA sequence to which it is operabiy linked. This term is 
intended to represent a promoter element providing basal transcription. A bas.c promoter 
,< frequently consists of a TATA box or TATA-like box and is bound by an RNA 
polymerase and by numerous transcnpt.on factors, such as GTFs and TATA box B.noing 
Proteins (TBPs). 

The terms "basic promoter" and "regulatory element" further encompass "tissue 
specific" promoters and reculatory elements, i.e.. promoters and regulator, elements 
^0 which effect exoress.on of the selected DNA sequence preferentially in spec.f.c cells < e.g.. 
cells of a specf.c tissue I Gene expression occurs preferentially in a specific cell it 
expression in this cell type is significantly higher than expression in other cell types. Tne 
terms "promoter" and "reeulatory element" also encompass so-called "leaky promoters 
and "reaulatorv elements", wh.cn regulate expression of a selected DNA primarily in one 
- tissue, but cause expression in other tissues as well. The terms -promoter" and -regulatory 
element" also encompass non-tissue specific promoters ana regulatory cements, i.e., 
promoters and regulatory elements which are active in most cell types. Furthermore, a 
promoter or regulator, element can be a constitutive promoter or regulatory element, i.e.. a 
promoter or reeulatorv element which const.tutively regulates transcription, as opposed to 
30 a promoter or regulatory element which is inducible, i.e.. a promoter or regulatory element 
which is active pnmariiv in response to a stimulus A stimulus car. be. e.g.. a mo.ecuie. 
such as a hormone, a cytokine, a heavy metal, phorbol esters, cyclic AMP (cAMP,. or 
retinoic acid. 

The term "core promoter element" is intended to include tne TATA box and the 
35 initiator element. 

"DNA recocn.tion sequence" or "DNA recognition element", as those phrases are 
used herein, mear. a DNA sequence which is capable of binding to one or more DNA- 
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binding domains, e.g.. of a transcription factor. 

The term "initiator" refers to a short, weakly conserved element that encompasses 
th- transcription start sue and which is important for directing the synthesis of pro P eri> 
initiated transcripts. While not wishing to be bound by any one theory, « is thought tnat 
TFIID contacts the initiator. 

The term "transcription factor" refers to any protein or modified form thereof that 
is involved in the initiation of transcription but which ts not itself a pan of the polymerase. 
Transcription factors are proteins or modified forms thereof, which interact preferentially 
with specific nucleic ac.d sequences, i.e.. regulatory elements, and which in appropnate 
, conditions stimulate transcription ("transcriptional activators") or repress transcr.pt.on 
("transcriptional repressors") Some transcription factors are active when they are in the 
form of a monomer Alternatively, other transcription factors are active in the form ot 
oligomers consisting of two or more identical proteins or different proteins (heterodimer) 
The factors have different actions curing the transcription initiation: they may interact with 
s other factors, with the RNA polymerase, with the enure complex, with activators, or with 
DNA. The factors are generally classifiable into two groups: (i) the general transcription 
factors, and (ii) the transcription activators. Transcription factors usually contain one or 
more regulatory' domains 

Tne term "regulatory domain" refers to any domain which regulates transcription. 
0 and includes both activation and repression domains. The term "activation domain- 
denotes a domain in a transcription factor which positively regulates (increases , the rate ot 
gene transcription The term "repression domain" denotes a domain in a transcription 
factor which negatively regulates i inhibits or decreases) the rate of gene transcription. 

The term genera! transcription factor" used interchangeably herein with tne tern: 
;j "GTF" and with "basic transcription factor" refers to proteins or protein complexes wn.cn 
work in concen with RNA Polymerase II to bring about promoter recognition ana accurate 
transcription initiation These prote.ns constitute, together with the RNA polymerase 1!. 
the Transcription Initiation Complex. GTFs include TF1IA. TFI1B. TFIID. TFIIE. TFIIF. 
and TF11H. These GTFs are usually sufficient to direct basal levels of transcription m 
30 vitro from strong promoters .i.e.. those containing TATA boxes 1 ,. Several GTF interact 
with one another andor with RNA Polymerase II. For example. TFIIE interacts witr. 
TFIIH and RNA Polymerase II. TF11 F interacts with RNA Polymerase II and with TFIIB. 
and TFII B interacts with TBP from TFIID and RNA Polymerase II. 

The term "transcriptional activator" as used herein refers to a protein or protein 
3< complex which is capable of enhancing the efficiency with which the basal transcription 
complex performs, i.e.. activating transcription. Thus, as used herein, a transcriptional 
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activator car, be a s.neie prote.n or alternatively it can be composed of several umts at least 
some of which are not covalently linked to each other. A transcriptional activator typically 
has a modular structure, i.e.. comprises various domains, such as a DNA binding domain, 
and one or more transcriptional activation tags. Some transcnpuonaJ actuators may 
< engage a subset of GTFs. For example, some transcriptional activators may include 
activation tags which collectively contact both TFIID and TFI1B Other transcriptional 
activators may contact a single GTF. 

The term "cefaclor" which is used interchangeably herein with the terms "co- 
activator" • adaptor" and "mediator" refers to prote.ns which either enhance or repress 
,0 transcr.pt.on in a non-gene specific manner, e.g.. which lack intrinsic DNA bind.ng 
specificity Thus, cofactors are general effectors. Positively acting coiactors do not 
stimulate basal transcription, but enhance the response to an activator. Positively acting 
cofactors include PCI . PC2. PC3. PC4. and ACF. TAFs wh.ch interact directly .with 
transcriptional activators are also referred to as cofactors. 
1 s The term, "transcriptional activation tag", also referred to here.n as "activation tag", 

"transcriptional actuation unit" and "activation unit", refers to a peptide sequence which is 
capable of inducine or otherwise potentiating activator-dependent transcription, either on 
,ts own or when linked covalently or non-covaiently to another transcriptional activation 
unit As opposed to a transcriptional activator generally, an activation tag corresponds to a 
^0 minimal polypeptide sequence which retains the ability to interact directly or indirectly 
with a transcription factor Of course, unless otherwise clear from the context, where a 
chimeric protein is referred to as "including" or "comprising" an activation tag. it will oe 
understood that other pontons of the protein from which the tag is derived can be included. 
Transcriptional activation tags can be rich in certain amino acids. For example, a 
-< transcriptional activation unit can be a peptide nch in acid>c residues, glutamme. proline. 

serine and threonine residues. Yet other transcriptional activators can be rich in 
isoleucine or basic amino ac.d residues (see. e.g.. Tnezenberg (1995) Cur Opin Gen. 
Develop. 5:190. xnd references therein!. For instance, an activation tag car. be a peptide 
motif of at least about 6 amino acid residues associated with a transcription activation 
-.0 domain, includinc the well-known "acidic", "giutamtne-nch" and "proline-nch" motifs 
such as the K13 mot,: from P 65. the OCT2 0 domain and the OCT2 P domain, 
respectively. 

A "dimenzation doma.n" is defined as a domain that induces formation of aimers 
between two prote.ns having that doma.n. while a "tetramenznt.on domain" ,s defined as a 
»< domain that induces formation of tetramers amongst prote.ns ccnta.n.ng tne 
tetramenzation doma.n. An "oi.gomenzat.on doma.n", generic for both dimenzation and 
tetramenzation domains, facilitates formation of ol.gomers. wh.ch can be of any subur.it 
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stoiechiometry (of course greater than one). 

The term "interact" as used herein is meant to include detectable interactions 
betw-n molecules, such as can be detected using, for example, a yeast two hybrid assay 
or bv immunocrecioiwtion. The term interact is also meant to include "Binding 
interactions between molecules. Interactions may be, for example, protein-protein. 
protein-nucleic acid, protein-small molecule or small molecule-nucleic acid m nature. 

The term "holoenzyme complex" refers to RNA Polymerase [1-containing 
complexes. 

The term -squelching'- which is used interchangeably herein with the term 
• activator interference- refers to the inhibition of transcription observed when an activator 
is rresC nt at artificially high concentrations (Ptashne and Gann (1990) Nature 346:3:9!. 
While not bound bv am particular theory, this inhibition is understood to result from the 
sequestration bv the activator (unbound to DNA) of either U) an adaptor component 
necessary to bridge the activator wnth the basal machinery, and/or (ii) a basal component. 

The term, subumf. when referring to the subunit of a transcriptional activator, 
refers to any unit of the transcriptional activator, e.g.. a transcriptional activation unit, a 
DNA binding domain, or a ligard binding domain. 

The term. "unit", when referring to a unit of a transcription factor, refers generally 
to a minimum portion of a transection factor having a specific activity, e.g.. 
20 transcriptional activation, transcriptional repression. DNA binding, or ligand binding. 

As used here.n. the term ' transfection" means the introduction of a nucleic acid, 
e g an expression vector, into a rec.p.ent cell by nucleic acid-mediated gene transfer. The 
term "transduction" is generally used herein when the transfection with a nucleic acid is bv 
viral deliverv of the nucleic aod. •'Transformation", as used herein refers to a process in 
25 which a cell's genotype is changed as a result of the cellular uptake of exogenous DNA or 
RNA and. for example, the transformed cell expresses a recombinant form of a 
poivoeptide or. m the case of anti-sense expression from the transferred gene, the 
expression of a naturally-occurring form of the recombinant protein is disrupted. 

As used herein, the term, "transgene" refers to a nucleic acid sequence which has 
-, 0 been introduced into a cell. Daughter cells deriving from a cell in which a transgene has 
been introduced are also said to contain the transgene .unless it has been deleted. A 
transgene can encode, e.g.. a polypeptide, partly or entirely heterologous, i.e.. foreign, to 
the transeemc animal or cell into wh.cn it is introduced, or. is homologous to an 
endogenous gene of the transgenic ammal or cell into which it is introduced, bu, wn.cn ,s 
desiened to be inserted, or is inserted, into the animal's genome in such a way as to alter 
the genome of the cell into which it is inserted .e.g.. it is inserted at a location which 
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difTers from thai of the natural gene). Alternatively, a transgene can also be present in an 
episome. A transgene can include one or more transcriptional regulatory sequences and 
any other nucleic add. .e.g. intron), that may be necessary for optimal express.on of a 
selected coding sequence. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transport.ng another nucleic acid to which it has been linked. One type of preferred vector 
is an episome. i.e.. a nucleic acid capable of extra-chromosomai replication. Preferred 
vectors are those capable of autonomous replication and' or expression of nucleic acids to 
which thev are linked Vectors capable of directing the expression of genes to which they 
,0 are operat'ively linked are referred to herein as "expression vectors". In general, expression 
vectors of utility m recombinant DNA techniques are often in the form of "plasmids" 
which refer genera lly to circular double stranded DNA loops which, in their vector form 
are not bound to the chromosome. In the present specification, "plasmid" and "vector" are 
used interchangeably as the plasmid is the most commonly used form of vector However. 
15 the invention is intended to include such other forms of expression vectors which serve 
equivalent functions and which become known in the art subsequently hereto. 

"Derived from" as that phrase is used herein indicates a peptide or nucleotide 
sequence selected from within a given sequence. A peptide or nucleotide sequence derived 
from a named sequence may contain a small number of modifications relative to the parent 
20 sequence, in most cases representing deletion, replacement or insemon of less than about 
15%. preferably less than about 10%. and in many cases less than about 5%. of amino 
acid residues o: base pairs present in the parent sequence. In the case of DNAs. one DNA 
molecule is also considered to be derived from another if the two are capable of selectively 
hybridizing to one another. 

The terms "chimeric'", "fusion" and "composite" are used to denote a protein, 
peptide domain or nucleotide sequence or molecule containing at least wo component 
portions which are mutually heterologous in the sense that they are not. otherwise, found 
directly (covalently. linked in nature. More specifically, the component portions are not 
found in the same continuous polypeptide or gene in nature, at leas; not in the same order 
30 or orientation or with the same spacing present in the chimeric protein or composite 
domain. Such materials contain components derived rrom at least two different proteins or 
genes or from at least two non-adiacent portions of the same protein or gene. Composite 
prote.ns. and DNA sequences which encode them, are recombinant in the sense that the> 
contain at least two consutuen: portions which are not otherwise found directly linked 
35 (covalently ) together in nature. 
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ill Fiemplan Chimeric A ctivators 

In one aspect, the invention provides composite activators comprised of a 
multiplicity of activation tags fused to. e.g.. a DNA-bindmg domain, a l.eand binding 
domain and/or an oligomerization domain. 

A composite transcription activation region consists of a continuous polypeptide 
region containing two or more reiterated or otherwise heterologous activation tags The 
activation ta E s comonse polypeptide sequences derived from at least mo dirlerent 
proteins, polypeptide sequences which do not ordinarily occur in the same orientation 
relative to one another (including reiterated copies of a polypeptide sequence), or 
10 polypeptide sequences which do not occur in nature. 

In one embodiment, at least one of the activation tags recruits TFIIA to a TFIID 
complex to form, a "DA" complex, and/or stabilized the formation of a DA complex. 
Other activation tags recruit, or stabilize, complexes including other TAFs and co- 
activators reauired for activator-dependent transcription. For instance, in certain cases the 
,« composite activator sequence may recru.t TFIIA and TF1IB, TFIIE. TFIIF. or TFIIH. in 
a preferred embodiment, the composite activator includes at least two different activation 
tags from the group of acidic activation tags, prolme-nch transcription actuation tags, 
senne'threonine-nch activation tags, glutamine-nch activation tags, and AP activation 
tags, and even more preferably, at least two of those activation tags are selected from 
20 disparate proteins u.e.. that do not naturally occur together in the same protein;. 

Where multiple copies of a particular activation tag are included in the same 
contiguous polvpeotide. the composite activator preferably includes at least 3 con.es of the 
activation tag. but more preferably at least 5. 10. 15. or even at least 20 copies of the tag. 

Shown m Figure 1 1 is a representative example of the subject composite activators 
- In the exemplary composite activator, two or more of the activation lags are from different 
proteins The AP activation sequence and the p65 (450-550) activation sequence are 
derived from the human NF-lB p65 subumt. whereas the V8 domain is an aciaic activation 
tag from the N-term.nus of VP 16. and the Vc activation tag is a C-term.nal portion of 
VP! 6 Each of the b65<450-550\ V8. and Vc sequences are capable of inducing activator- 
30 deoendent transcription. Presented in the same polypeptide sequence, these activation tags 
induce expression of a responsive gene in a synergistic fashion On the other hand, tne AP 
activation tac on its owr. is insufficient to induce activator-dependent transection, but 
when combined with an activation tag such as the V8 activation tag. the nreser.ee ot tne 
AP activation tac increases transactivation of the gene relative to tne V8 activation tag 
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Another feature of the illustrated composite activator is the repetition of certain cl 
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the activation tag sequences. Renting the V8 and Vc activation tags, for example, 
increases the fortiiude of the resulting protein as a transcriptional activator, particularly o! 
genomicaJly integrated genes. 

^s described m more detail below, the ideal number of activation tags to include. 
< and Lhe relative orientations of the various tags to one another in the composite activators, 
can be readily determined by techniques known in the an. including high througnput 
screening using reporter gene-based assays. 

Based at least :n nan on the lack of squelching observed from the instant composite 
activators the subiect fusion proteins can include one or more ohgomcnzat.on sequences 
,0 which nertnus non-covaient oligomer.zatton of multiple composite actuators For 
mstance. as Fisure 3C shows, including a tetramer.zation domain permits the formation ot 
multimenzed "complexes of the composite activator sequences. Reiterating from above, 
this embodiment derives in pan from the discovery that squelchmg due to abunaance of 
act.vat.on tags, particularly for forming the DA complex, is not a hindrance .n expression 
15 from genomic sequences. 

^s desired, the comrosite activation sequence can be provided as pan of a fusion 
-rotein including a DNA binding domain. In other embodiments, such as illustrated in 
Figure 3. the comnosite activation sequence can be fused with a ligand b.no.ng domain 
which in the presence of a multivalent ligand. can facilitate recruitment of me composite 
20 activator to a DNA-bound complex. Tne complex can be loaded with multiple activators, 
in a ligand-dependent manner, by inclusion of multiple ligand binding domains. 

In the instance where the composite activator contains one or more oiigomenzation 
domains and'or ligand binding domains, but is not contiguous with a DNA 'ending 
domain, the composite activator can be coexpressed in cells w.th a second protein 
25 including a DNA binding doma.n and appropriate oiigomenzation or ligand binding 
domains to form complexes w.th the composite activator proteins. Thus, composite 
activator proteins can be recruited to a site of transcriptional reguiauon by interaction with 
a DNA binding protein by oiigomenzation. which may be constitutive or inducible. 

Techniaues for making the subiect fusion proteins are adapted from well-knoNvn 
30 procedures. Essentially, the joining of various DNA fragments coding tor different 
coivoentide sequences is performed in accordance with conventional techniques, 
emplovine blunt-ended or stagger-ended termini for ligation, restriction enzvme digestion 
to provide for appropriate termini, filling in of cohesive ends as appropriate, alkaline 
Phosphatase treatment to avoid undesirable joining, and enzymatic ligation. Alternatively. 
t< the fus.on gene can be synthesized by conventional techniques including automated DNA 
synthesize*. In another method. PCR amplification of gene fragments can be carr.ee out 
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using anchor primers which give rise 10 complementary overhangs between two 
consecutive sene fragments. Amplification products car. subsequently be annealed to 
generate a chimeric gene sequence (see. for example. Current Protocols tn Molecular 
Biology, Eds. Ausabel et al. John Wiley & Sons: 1992V 



A. Activation Tags 

The activation tags useful in the composite activators of the present invention can 
be derived from one or more transcription factors. Polypeptides which can function to 
activate transcription in eukaryotic cells are well known in the an. In particular, 
transcriptional activation domains which contain suitable activation tags have beer, 
described for many DNA binding proteins and have been shown to retain their activation 
function when the transcriptional activation domain, or a suitable fragment thereof, is 
transferred to a heterologous protein. 

Activation tags can be naturally occurring or can be synthetic, so long as. either 
alone or in combination with other activation tags, they are capable of enhancing the 
efficiency with which the basal and/or activator-dependent transcription complex 
performs, i.e.. so long as the> are capable of interacting with a transcription factor or co- 
activator protein Any parucuiar activation tag is preferably at least 6 amino ac.ds in 
length, and preferably contains no more than about 300 amino acid residues, though even 
more preferably .ess than. 200 or even less than 100 residues. 

Naturalh occumnc activation units include portions of transcription factors, such as a 
thirty amino acid fragment of the C-termmus of VP16 (amino acids 461-490). referred to 
herein as "Vc". 

Other actuation units are derivatives of naturally occurring peptides For example, 
the replacement of one amino ac.d of a naturalh occurring actuation unit by another may 
further increase activation An example of such an activation unit is a derivative of an 
e,ght amino acid peptide of VP 16. the derivative having the ammo ac.d sequence 
DFDLDMLG. 

Yet other activation units are entirely synthetic. It is known, for example, mat 
certain random alignments of acidic amino acids are capable of activating transcription. 
It is well known m the art that certain transcription factors are active only in specific cell 
tvpes i.e., that transcription factors can act in a tissue specific manner. W ithout wanting 
to' be limited to a specific mechanism of action, it is possible that thus tissue specificity 
results from the fact that the transcription factor interacts with specific factors, e.g. 
cefaclors, which are present only m certain cell types This tissue specifics can be 
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localized to a specific portion of the transcnpt.on factor. In certain transcnption factors, 
this portion is located outs.de activation domains, whereas in other transcnption factors. 
*h.s portion mav be localized within an activation domain. Thus, by using activation tags 
which are functional essentially in specific cells, it is possible to design a transcriptional 

5 activator of the invention having a cenain tissue specificity. 

A preferred pohpept.de for use in the fusion protein of the invention is tne herpes simplex 
v,rus virion protein 16 (referred to herein as VP 16, the amino acid sequence of which is 
disclosed in Tnezenberg. S.J. et al. (1988) Genes Dev. 2:718-7291 In one embod.ment. an 
activation tag corresponding to about 127 of the C-terminal amino ac.ds of VP 16 ,s used. 

0 For example, a roivpeptide having an amino acid sequence shown in SEQ ID NO: X 
.positions 208-335) can be used as the second polypeptide in the fusion protein. In 
another embodiment, at least one copy of about 1 1 amino ac.ds from the C-terminal region 
of VP16 which retain transcriptional activation ability is used as the activation tag. 
Preferably an oligomer of this region (i.e.. about 22 ammo acids) is used. Suitable C- 

5 terminal peptide portions of VP16 are described in Seipel. et al. (EMBO J (1902) 
13:4961-4968) 

Another example of an acidic activation tag is provided in residues 753-881 of 

GALA 

One particularly important source of transcription activation tags which are 
^0 featured in a number of embodiments of the invention is is the (human) NF-kB subun.t 
P 65 In one emrodiment the chimeric activator contains one or more copies ot a peptide 
sequence comensme all or part of the p65 sequence spanning residues 450-550. or a 
peptide sequence derived therefrom. In certain embodiments, it has been round mat 
extending the P 65 peptide sequence to include sequence spanning P 65 residues 361 -4?C. 
;< e.g.. including the "AP activation tag", leads to an unexpected increase in transcription 
activation. Moreover, a peptide sequence comprising all or a pon.on of P 65(36i -550.. or 
peptide sequence derived therefrom, in combination with heterologous activation tags, can 
vield surprisinc additional increases in the level of transcnption activation P 65 -based 
activation domains function across a broad range of promoters and have yielded increases 
,0 in transcription levels six-fold, eight-fold and even 14-15-fold higher than obtained with 
tandem copies of VP 16 which itself is widely recognized as a very potent activation 
domain. 

While the resultant increases in activation potency are dramatic. p65-based 
transcnption factors possess additional and unexpected charactensucs. For instance, unlike 
;< VP 16. the subject P 65-based activators do not appear to be toxic to the engineered cells. 
Tnis is ciearlv c: profound practical significance in many applications. It is expected that 
recombinant DNA molecules encoding chimenc proteins which contain a p65 activation 
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tag. or peptide sequence derived therefrom, will provide significant advantages for 
heterologous gene expression in its various contexts, including constitutive systems such 
as described herein, as well as in regulated systems such as described in Intemauonai 
patent applicat.ons PCTXS94/01617. PCT/US95/10591. PCT<US96-09948 and the like. 
< as well as in other heterologous transcription systems such as those involving teiracyciin- 
based regulation reported by Buiard et al. and those involving steroid or other hormone- 
based regulation. 

One class of p65-based transcription factors contain more than one copy of a p65- 
denved domain. Such proteins will typically contain two to about six copies of a peptide 
,0 sequence comprising ai! or a portion of p65(361-550). or peptide sequence derived 
therefrom 

Other polypeptides with transcriptional activation ability in eukaryonc cells can be 
used to provide activation tags for the fusion protein of the invention. Transcriptional 
activation domains found within various proteins have been grouped into categories cased 
!5 upon similar structural features. Types of transcriptional activation domains include, in 
addition to the acidic transcription activation domains, prohne-rich transcription activation 
domains, serine threonine-rich transcription activation domains and glutamine-nch 
transcription activation domains Examples of proline-nch activation domains include 
ammo acid residues 39Q-49Q of CTF/NF1 and amino acid residues 31-76 of AP2 
20 Examples of serine -threonine-rich transcription activation domains include amino acid 
residues 1 -427 of IT? 1 and amino acid residues 2-45 1 of ITF2. Examples of glutamine- 
nch activation domains include ammo acid residues 175-269 of Oct! and ammo acid 
residues 132-243 of Spl. The amino acid sequences of each of the above described 
regions, and of other useful transcriptional activation domains, are disclosed in Seipel. K 
25 etal.fEMBO J. ( 1 992 » '.3 4961-49681 

Still other illustrative activation aomains and motifs of human origin include the 
activauon domain of human CTF. the 18 ammo acid (NFLQLPQQTQGALLTSQP> 
glutamine rich recion of Oct-2. the N-termmal 72 ammo acids of p53. the SYGQQS repeat 
m Ewmg sarcoma gene and an 11 amino acid (535-545. acidic rich region of Re! A 
30 protein. 

In addition to previously describee transcriptional activation domains, novel 
transcriptional activation tags, which car. be identified by standard techniques, are wnthm 
the scope of the invention. The transcriptional activation ability of a polypeptide can be 
assayed bv linking the polypeptide to another polypeptide having DNA binding activit> 
3 5 and determining the amount of transcription of a target sequence that is stimulated by the 
fusion nrotein. For example, a standard assay used in the art utilizes a fusion protein ot a 
putative activation tag and a GAL4 DNA binding domain (e.g.. ammo acid residues 1-Q3) 
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Tnis fus.on protein is then used to sumuiate expression of a reporter gene linked to GAL4 
bmding sues (see e.g.. Seipel. K. el al. (1992) EM BO J. 1 1 :496i-4968 and references citee 
therein). 

As set out above, transcnpuonal activators can also compnse activation tags, x*hich 
< i„ the absence of other activation tags are incapable of activating transcnpt.on For 
example, the activation tag AP from VP 16 does not activate transcription in the absence oi 
other activauon units, even when present in multiple copies. 

According, the invention provides an AP activauon tag. or derivative thereof, 
which mav interact with TFIIA. but which on its own is incapable of activating 
i0 transcription. In one embodiment, the AP polypeptide has the am.no acid sequence of 
residues 361-450 cf SEQ ID NO.2. or a fragment thereof Functionally equivalent 
derivatives of the AP activation tag can be obtained, for instance, by screening derivatives 
of AP for bindine to TF11A and measuring transcriptional activity of such a derivative in a 
co-transfect.on assav. as described above. Such equivalents are expected to include forms 
• 5 of the activation tag which axe tuncated a. either the N-term.nus or C-term.nus or ooth. 
e g fragments of P 65 <or homoloeous sequences thereto) which are about 7a. 60. 50. ,0 
or ever 20 amino acid residues in length (e.g., ranging in length from 20-89 amino acids). 
Likew.se it is exacted that the AP activation tag sequence from p65 can tolerate amino 
acid substitutions, e.g.. to produce AP tags of at least 95%. 90%. 80% and even 70% 
20 identity with the AP activation tag sequence of SEQ ID No. 2 

The multiple activation units and other domains of the transcriptional of the 
invention can be from anv eukaryotic spec.es, and it is not necessary that ever, unit or 
domam be from the same spec.es. Preferred spec.es include vertebrates, such as 
mammals. Even more prefened units or domains are from humans. For use of the 
^ transcriptional activators of the invention in gene therapy in a subject of a specific species, 
e.g.. human, it is preferable to use units and domains from the same species to avo.G 
immune reactions against the transcriptional activator or complex 

The activation units of a transcriptional (reactivator can be covaientiy linked to 
each other in a linear array, i.e.. the NH2-terminus of one activation unit is hnked to the 
COOH-term.nus of another activation unit. The activation units can be arranged in any 
orde- However, as described herein, it has been observed, that certain arrangements o: 
activation units results in higher levels of transcriptional activation man other 
arrangements. The order in which the activation units should be arranged will depend on 
the result desired, i.e.. the degree of transact! v anon that one desires to achieve, and can be 
de-errmned e.g.. bv performing cotransfection experiments, as descnbed in the Examples. 
Bneflv expression vector encoding the activauon units in various arrangements linked to a 
DN A binding domain are cotransfected together with a reporter construct containing a 
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reponer eene ocerabiy l.nked to a promoter containing a DNA sue recognized by the DNA 
binding domain, and expression of the reporter gene is measured. For such assays, it is 
preferable to use a ceil line in which the activation units are known to be active. 

In order to optimize the transcriptional activity of a composite activator of the 
« subject invent.cn. it can be advantageous to shuffle the activation tags relative to one 
another in polvpeptide. There are a variety of combinatorial techniques ava.iaoie wn.cr. 
can be adapted for creating combinatorial libranes of the subject composite activators o> 
creating and sampling libranes of proteins in which the activation tags are shuffled in the 
polvpeptide to some degree relative to one another. 
,0 In an exem F iarv embodiment, the coding sequences for individual activation tags 

are synthesized to include a Notl cleavage site at both the 5. and 3i end by addition of the 
sequence GCGGCCGCN. which aiso encodes (Ala)3, an unstructured linker. The benefits 
of unstructured linkers are descnbed below. A variety of nucleic acids encoding such 
activation tag sequences can be treated with Notl. admixed with one another, and religated 
,5 to form a combinatorial library representing various positional combinations of tne 
different activation tag sequences relative to each other. Those combinations ) of 
activation tags producing the greatest level of transcriptional activation can be ready 
identified in the resulting combinatorial library, e.g.. by selections based on expression ot 
a quantifiable reporter gene. 
-0 In another embodiment, combinatorial intron splicing can be used to generate a 

diverse library of composite activation sequences. U.S. Patent 5,498.531 describes a 
means for earning out the equivalent of "exon shuffling- by intron-med.ated trans- 
splicing. Bnefly. the trans-splicing system of the e531 patent provides an active set ct 
transcripts for trans-splicmg wherein flanking intronic sequences can interact ir. ar. 
^ intermodular reaction to reconsitute a reactive complex which promotes 
transesterification of two or more transcripts. RNA transenpts are derived which induce 
an -exon" sequence, e.g.. encoding an activation tag. which is flanked on each side o> 
intron sequences that can direct trans-splicing of the exon sequences to each other 3y 
admixing these RNA constructs, .mermolecula: complementation between the t.arjcmg 
30 intron seauences of two different constructs forms a functional intron which mediates the 
transestenf.cation reactions necessary to hgate two discontinuous activation tag sequences 
to one another, and thereby generate a elongated transcript comprising these concatenated 
RNA sequences encoding activation tag sequence. In one embodiment, the activation tag 
"exons" are flanked bv portions of one of a group I or group II intron. such that the 
-.5 interaction of the flanking intromc sequences is sufficient to produce an autocatalyt.c core 
cacable of driving ligation of the exons in the absence of any other factors In the present 
context, the term "exon- merely denotes nucleic acid sequences encoding polypeptides 
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including an activation tag. and can. for instance, correspond to naturally and non- 
naturally occurring exon sequences. The resulting RNA transcripts are reverse 
transcribed, and as above, the positional permutations of greatest transcriptional activity 
are easilv isolated. 
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B. DNA binding domain 

In certain embodiments of the invention, the composite activator is provided as 
pan of a chimeric protein which further composes a DNA-binding domain. In other 
embodiments, the composite activator is localized to a DNA sequence by virture of a 
constitutive or inducible oligomerization with chimeric DNA binding domain In such 
instances, the DNA binding domain can be provided in a fusion protein which one or more 
oligomerization domains or ligand binding domains. The choice of component DNA- 
binding domains may be influenced by a number of considerations, including the species, 
system and cell type to which is targeted; the feasibility of its incorporation into a chimeric 
protein, as may be shown by modeling: and the desired application or utility. 

The DNA binding domain can be a naturally occurring DNA-binding domain from 
a transcription factor Alternatively, the DNA binding domain can be an artificial (or 
partially artificial i polypeptide sequence having DNA binding activity. For example, the 
DNA-binding domain can be a naturally occurring DNA binding domain that has been 
modified to recognize a different DNA binding site. The particular DNA-binding domain 
chosen will depend on the target promoter. For example, if the gene to be transcriptionally 
activated by the subject method is an endogenous gene, the DNA-binding domain must be 
able to interact with the promoter of the endogenous gene .endogenous promoter:. 
Alternatively, as described in greater detail below, the endogenous promoter could be 
.5 replaced, e.g.. by homologous recombination, with a heterologous promoter for which the 
DNA binding domain is selected. Such a substituuon may be necessary if no transcription 
factor is known to bind the endogenous promoter of interest. Alternatively, in such a 
situation, it is also possible to clone a DNA-binding domain interacting specifically with a 
sequence in the promoter of interest. Tnis can be done. e.g.. by phage display screening 
30 with a DNA molecule comprising at least a portion of the promoter ot interest. 

Desirable properties of DNA binding domains include high affinity for specific 
nucleotide sequences, termed herein "target sequences", low affinity for most other 
sequences in a complex genome isuch as a mammalian genome!, low dissociation rates 
from specific DNA sues, and novel DNA recognition specificities distinct from those of 
35 known natural DNA-binding proteins. Preferably, binding of a DNA-b.nd.ng domain to a 
specific target sequence is at least two. more preferably three and even more preferably 
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more than four orders of magn.tude greater than binding to any one alternative DNA 
sequence, as mav be measured by relative Kd values or by relat.ve rates or levels ol 
transcription of genes associated with the selected and any alternative DNA sequences. It 
is also preferred that the selected DNA sequence be recognized to a substantially greater 

s degree bv the DNA binding domain of the trancriptional activator of the invention than by 
an'endogenous protein. Tnus. for example, target gene expression in a cell is preferably 
two. more preferably three, and even more preferably more than four orders of magnitude 
greater in the presence of the transcriptional activator of the invention containing a DNA- 
bmding region than m its absence. 

,0 Preferred DNA binding domains have a dissociation constant for a target sequence 

below 10-8 M . preferably 10"" M. more preferably below 10-1° M . eve n more preferably 
below 10-" M- For gene therapy applications, they are preferably derived from human 
proteins. 

From a structural perspective. DNA-binding that can be used in the invention may 
,5 be classified as DNA-binding proteins with a helix-turn-helix structural design, such as. 
but not limited to. Myb. Ultrabithorax. Engrailed. Paired. Fush. tarazu. HOX. L'nc86. the 
Ets and homeobox families of transcription factors, and the previously noted Octl. Oct: 
and Pit. zinc finger proteins, such as Zif268. SWI5. Rr.ppel and Hunchback; steroid 
receptors; DNA-b,nd,n C proteins with the helix-loop-helix structural design, such as 
20 Dauehterless, Achaete-scute (T3 ». MyoD. El 2 and E47; and other helical motifs like the 
leucine-zipper, which includes GCN4. C/EBP. c-Fos/c-Jun and JunB. The ammo acid 
sequences of the component DNA-binding domains may be naturally-occumng or non- 
naturallv -occurring -or modif.edV DNA-binding domains and their target sues can be 
found at TF SEARCH ihttp: "www. genome. adtjp. SIT TFSE ARCH html) Another 
25 publicly available database of transcription factors and the sequences to which they bind is 
available from the National Library of Medicine in the "Transcription Data Base - 
One strategy for obtaining component DNA-binding domains with properties 
suitable for this invention is to modify an exisung DNA-binding domain to reduce its 
affinity for DNA into the appropriate range. For example, a homeodoma.n such as that 
;,0 derived from the human transcription factor Phox 1 . may be modified by substitution of the 
glutamme residue at position 30 of me homeodoma.n. Substitutions at this position 
remove or change an important point of contact between the protein and one or two base 
pairs of die 6-bp DNA sequence recognized by the protein Thus, such substitutions 
reduce the free energy of binding and the affinity of the interaction with this sequence and 
35 mav or mav not simultaneously increase the affinity for other sequences. Such 3 reduction 
in affinitv is sufficient to effectively eliminate occupancy of the natural target site by this 
protein when produced at typical levels m mammalian cells But it would allow this 
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domain to contribute binding energy to and therefore cooperate with a second linked 
DNA-binding domain. Other domains that amenable to this type of manipulation include 
the paired box. the zinc-finger class represented by steroid hormone receptors, the myb 
domain, and the ets domain. 

In another embodiment, the DNA binding domain is created from the assembly of 
DNA binding domains from various transcription factors, resulting in a DNA binding 
domain having a novel DNA binding specificity. Such DNA binding domains, referred to 
herein as "composite DNA binding domains" can be designed to specifically recognize 
unique binding sues For example, a DNA binding domain car. be constructed that 
10 compnses DNA binding regions from a zinc finger protein and a homeobox protein. One 
such DNA binding domain is ZFHD1. a composite DNA binding domain comprising an 
Oct-1 homeodomair. and zinc fingers 1 and 2 of Zif268. which is further desenbed in PCT 
Application WO 96 2095 1 by Pomerantz et al. 

The DNA sequences recognized by a chimenc protein containing a composite 
15 DNA-binding domain can be determined experimentally, as described below, or me 
proteins can be manipulated to direct their specificity toward a desired sequence. A 
de Sl rable nucleic ac:d recognition sequence consists of a nucleotide sequence spanning at 
least ten. preferably eleven, and more preferably twelve or more bases. The component 
binding portions (putative or demonstrated) within the nucleoude sequence need not be 
20 fully contiguous: they may be interspersed with "spacer" base pairs that need not be 
directly contacted by the chimeric protein but rather impose proper spacing between the 
nucleic acid subsites recognized by each module. These sequences should not impart 
expression to linked ger.es when introduced into cells in the absence of the engineered 
DNA-binding protein 

; 5 To identity a nucleotide sequence that is recogmzed by a transcriptional activator 

protein containing the composite DNA-binding region, preferably recognized with high 
affinity (dissociation constant 10-" M or lower are especially preferred), several methods 
can be used. If high-affinity binding sites for individual subdomains of the composite 
DNA-binding region are already known, then these sequences can be joined with vanous 

30 spacing and orientation and the optimum configuration determined expenmentaliy ;see 
beiowfor methods for determining affinities i. Alternatively, high-affinity binding sites 
for the protein or protein complex can be selected from a large pool of random DNA 
sequences by adaptation of published methods f Pollock. R. and Treisman. R.. 1990. A 
sensitive method for the determination of protein-DNA binding specificities. Nucl. Acids 
35 Res. 18. 6197-620-1) 3ound sequences are cloned into a plasmid and their precise 
sequence and affinity for the proteins are determined. From this collection of sequences, 
individual sequences with desirable characteristics (i.e.. maximal affinity for composite 
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protein, minimal affinity for individual subdoma.ns) are selected for use. Alternatively, the 
collection of sequences is used to derive a consensus sequence that cames the favored base 
pa,rs at each position. Such a consensus sequence is synthesized and tested (see below, to 
confirm that it has an appropriate level of affinity and specificity. 

A number of well-characterized assays are available for determining the binding 
affinitv. usually expressed as dissociation constant, for DNA-binding proteins and the 
cognale DNA sequences to which they bind. These assays usually require the preparation 
of "purified protein and binding sue (usually a synthetic oligonucleotide! of known 
concentration and specific activity. Examples include electrophoretic mobility-shin assays. 
i3 DNasel protection or -footpriming". and filter-binding. These assays can also be used to 
get roueh estimates of association ana dissociation rate constants. These values may be 
determined with greater precision using a BlAcore instrument. In this assay, the synthetic 
oligonucleotide is bound to the assay "chip.- and punned DNA-binding protein is passed 
through the fiow-cei: Binding of the prote.n to the DNA immob.lized on tne ch. P is 
,< measured as an increase :n refractive index. Once protein is bound at equilibnum, butter 
without protein is passed over the chip, and the dissociation of the prote.n results in a 
return of the refractive index to baseline value. The rates of association and dissociation 
are calculated from these cun.es. and the affinity or dissociation constant is calculated 
from these rates. Binding rates and affinities for the high affinity composite site may be 
^0 compared with the values obtained for subsites recognized by each subdomain or the 
protein. As notec above, the difference in these dissociation constants should be at least 
two orders of magnitude and preferably three or greater. 

The invention farther provides composite activators of the present invention 
provided as a his.on prote.n with an inducible DNA binding domain, si In one 
25 embodiment, the inducible DNA binding domain is the E. col. tet repressor (TetR). which 
binds to tet operator [tetO) sequences upstream of target genes. In the presence ot 
tetracycline, or an analog, which bind to tetR. DNA b.nd.ng is abolished and thus 
transactivation is abolished This system, in which the TetR had previously been linked to 
transcription activation domains, e.g. from VP16. is generally referred to as an allosteric 
-0 "off-switch" aescnbed bv Gossen and Buiard (Proc Natl. Acad. Sci. U.S.A. '1^:» 
89:554-) and in U.S. Patents 5.464.-58. 5,650.298; and 5.589.362 by Buiard et al. 
Funhermore, depending on the concentration of the antibiotic in the culture medium (0-1 
mu a/ml), target gene expression can be regulated over concentrations up to several orders 
of macnitude. Thus, the system not only allows differenual control of the activity ot an 
3. individual eene ,n eukarvotic cells but also is suitable for creation of "orvofT situations tor 
such cenes'in a reversible way. This system provides low background and relatively high 
target" eene expression in tne absence of tetracycline or an analog Thus, the invention 
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described herein provides a method for obtaining even stronger transcriptional induction of 
a target eene. which is reeulatable by the tetracycline system or other inducible DNA 
bindma domain. For example, a TetR can be linked to a multiplicity of transcription 
activation un.ts. such that high levels of transcription occur in the absence ot tetracycline 

5 or analog thereof and that transcription is repressed in the presence of tetracycline. 

In another embodiment, a "reverse" Tet system is used, again based or. a DNA 
b.nd.ne domain that ,s a mutant of the E. coli TetR, but which binds to TetO ir. the 
presence of Tet. Tnus. the invention described herein provides a method tor obtaining even 
stronger transcriptional induction of a target gene in the presence of tetracycline or an 

ic analog thereof from a very low background in the absence of tetracycline. 
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C. Oligoraerization domains 

As set out above, in various embodiments of the composite activators, tne fusion 
prote.ns can also include at least one oligomenzauon domain. Such a domain can be a 
constitutive oligomenzauon domain, or an inducible oligomenzauon domain, i.e.. a 
domain mediating ohgomenzat.on only in the presence of a third molecule, such as a 
small organic molecule Examples of constitutive oligomenzauon domains include 
leucine zippers. 

Example of inducible oligcmerizanon domains include FK506 and cyclosporin 
binding domains of FR506 binding prote.ns and cyclophilins. and the rapamycm binding 
domain of FRAP (tori i. Sucn inducible oligomenzauon domains are referred to nerem as 
-ligand binding domains" and are mnher described herein under the section entitled 
accordingly. 

In one embodiment of the invention, at leas; one activation tag or iigand binding 
domain or DNA binding domain is linked to a constitutive oligomenzauon domain, e g . a 
dimenzanon or tetramer.zation domain. A dimenzation domain is defined herein as a 
sequence ofam.no acids capable of forming homod.mers or heterodimers. One example 
of a d.menzation domain is the leucine zipper (LZ» element. Leucine zippers have been 
identified, generally, as stretches of about 35 amino acids containing 4-5 leucme residues 
separated from eacn other by six amino acids (Maniatis and Abel (1989) Nature 341:2-- 
Exemplar leucine zippers occur in a variety of eukaryouc DNA binding proteins, 
such as GCN4. CEBP. c-Fos. c-Jun. c-Myc and c-Max. Other dimenzation domains 
include hel.x-looo-helix domains (Murre. C. et al. (1989. Cell 58:537-S44>. Dimenzauon 
domains mav also be selected from other proteins, such as the reunoic acid receptor, tne 
thvroid hormone receptor or other nuclear hormone receptors (Kurokawa et ai U99.M 
Genes Dev . 7:1423-1435 1 or from the yeast transcnption factors GAL 4 and HAP1 
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.Marmonstein et al. (1992, Nature 356:408-414; Zhang et al. (1993 , Proc. Natl. Acad. Sc, 
USA 90 2851-2855 ; Dimerizauon domains are further aescnbed in U.S. Patent No. 
5.624.818 by Eisenraan. 

In another embodiment, the ohgomenzat.on domain is a tetramenzat.on domain. 
For example four activation units can be l.nked to a single DNA binding domain or a 
Ucand binding domain by covalently linking the activation units to a tetramenzat.on 
domain In a preferred embodiment, the tetramerization domain is the E. coh lactose 
repressor tetramerization domain ( am.no acids 46-360. Chakenan et al. (1991) J. B.ol 
Chem -66-1371; Albert: et al. .1993, EMBO J. 12:3227; and Lewis et al. (1996) Nature 
-71 P47,, as described in the Examples. Tnus. the inclusion of a tetramerization domain 
m a transcriptional activator allows four activation domains to be complexed together and 
fo-in a transcriptional activator complex. Furthermore, more than one activation unit car. 
be l.nked to one tetramenzat.on domain, to thereby form a transcriptional activator 
complex comprising more than 4 activation units. 

In another embodiment, the tetramenzation domain is that from a p53 protein. The 
P <- tetramerization domain maps to residues 322-355 of p53 (Wang et al. ,1994) Mol. 
Cell. Biol. 14:5182. Clore et a!. (1994) Sc.ence 265:386) and is further described in L'.S 
Pat. No. 5.573.925 b> Haiazonetis. 

Tne invention also provides for transcriptional activators containing at least one 
modified oliEomenzation domain Modifications in the oligomer.zation domain may 
,n-rease the stabihtv of tetramer formation, for example, substimtions that stabilize 
oHaomerizauon driven by leucine uppers are know. (Krylov et al. (1994) c.ted above: 
O'Shea et al. (10921 cited above .. As an exemplary modification of this type, residues 174 
or P5 of human p53 are substituted by glutamine or leucine, respectively, in a P 53 
25 chimeric protein of this invention. 

In other embodiments, the oligomerization domain can be ar. altered p."? 
tetramenzation domain which is incapable of forming hetero-tetramers with p53 proteins 
that have a wild-type p53 tetramenzation domain, such as wild-type P 53 or tumor-derived 
F 53 mutants. Such altered p53 tetramerization domains are further descnbed m U S Pa: 
30 No. 5.573.925 by Haiazonetis. 

These altered p53 tetramenzat.on domains are characterized by disruption of the 
native p<" tetramenzat.on domain and insertion of a heterologous oligomerization 
doma.n'in a wav that preserves tetramerization According to this invention, a disruption 
of -he p<3 tetramerization domain, involving residues 335-348 or a subset of these 
,« residues sufficient I v disrupts the function of this domain so that it can no longer drive 
tetramerization with wild-type p53 or tumor-derived ?53 mutants. At the same time. 
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however, introduction of a heterologous dimerizauon domain reestablishes the ability to 
form tetramers. which is mediated both by the heterologous dimenzation domain and by 
the residuai tetramerization domain of p53. 

Other exemplar.- suitable tetramerization domains include artificial 
letramerizaiion domains, such as variants of the GCN4 leucine zipper that form tetramers 
(Alberti et ai. (1993 , EMBO J. 12:3227-3236: Harbur>' et al. (1993 ) Science 262:1401- 
1407; Krylov et al. (1994) C994) EMBO J. 13:2849-2861). One of skill in the an could 
readilv select alternate tetramerization domains. For example, the tetramenc variant of 
GCN4 leucine zipper described in Harbury et al (19931. supra, has isoleucines at positions 
d of the co.led co.'i and leucines at positions a. in contrast to the original zipper which has 
leucines and valines, respectively. 

The GCN- leucine zipper drives parallel subunit assembly [Harbury et al. (1993). 
cited above], while the native p53 tetramerization doma.n drives anuparalle! assemble 
[Clore et al. (1994) cued above: Sakamoto et al. (1994) Proc. Natl. Acad. Sci. USA 
91:8974-8978]. Thus, various conformations of activation unit complexes can be obtained 
bv choosing various tetramerization domains. 

In addition, the an also provides a variety of techniques for identifying other 
narurallv occurring oligomenzation domains, as well as oligomenzation domains derived 
from mutant or otr.env.se artificial sequences. See. for example, Zeng et al. (1997) Gene 
1 85:245: O'Shea et al. ( 1 9921 Cell 68:699-708: Krylov et al. [cited above]. 

The distance between the oligomenzat.on domain and other components of the 
fusion proteins can be varied. In one embodiment, there .s no linker between an activation 
un.t and a tetramerization doma.n. e.g. an altered GCN4 leucine zipper In other 
embodiments however, there are glutam.c acid or asparagme or isoleuc.ne linkers. 
< respectively. L.nkers ma> oe present for cloning convenience or to confer some useful 
property. For example, residues that stabilize specific secondary structure elements, such 
as alpha -helices, are known (Richardson et al (1988) Science 240:1648-1652]. Such 
res.dues can be introduced in the l.nkers to stabilize me oligomenzation domains For 
examole the linkers glycine-asparagine. arginine-glycine- asparagine. arginme-glycine- 
c givctne-asparagine-proi.ne-glutamic acid, glycine-glycine- asparagme-glutamme-alanine. 
are all designed to stabilize the N-termmus of the alpha -helical oligomenzation domain. 

In one embodiment, the chimeric protein composes an activation unit fused to an 
asoaragine linxer and then to a tetramenc variant of GCN4 res.dues 249-28 1. 
Alternatively, the linxer can be an arginine-glycme-asparagme linker, an arginine-glycine- 
is glycine-asparagme- proline-glutamic acid linker, a giycine-glycine-asparagme-glutamine- 
alar.ine linker. 
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A varietv of other amino acid or peptide linkers may be used for the reasons 
discussed above, provided they do not interfere with the function of the activation units 
and ligand binding domain or DNA binding domain. 
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D. Ligand binding domain 

In another embodiment of the invention, components of the subject system include 
one or more ligand binding domains for mediating oligomerization of fusion proteins in a 
hgand-dependent fashion, in a preferred embodiment, the ligand is capable of interacting 
with wo ligand binding domains. In an exemplary embodiment, formauon of 
transcriptional complexes is regulated by addition of a ligand. and comprises introducing 
into the cell a fusion protein including a composite activator and a ligand binding domain, 
as well as a second fusion protein including a DNA binding domain and a ligand binding 
domain, such that m the presence of the ligand. a transcriptional activator complex is 
formed between we two fusion proteins. Preferred ligands include macrolides such as 
rapamvcin, cyclosporin A. FK506. FK1012. and analogs thereof, and other synthetic 
dimenzers or ohgomenzers. Ligand binding domains include the FK506 binding domain 
of FKBP. the cyciosponn-binding domain of calcineurin. and the rapamycin-binding 
domain of FRAP These binding domains and ligands are further disclosed, e.g.. in 
PCT/US93/0161? Such fus.on proteins permit control of the expression of a target gene to 
be dependent on addition of an appropriate ligand. e.g.. one which is capable of interacting 
simultaneously with the two iigand binding domains. 

In general, the ligand binding doma.n of a chimeric protein of this invention can be 
anv convenient domain which will allow for Hgand-dependent oligomerization of fusion 
proteins using a natural or unnatural ligand. preferably an unnatural synthetic ligand. Ot 
particular interest are binding proteins for which ligands (preferably small organic ligands.. 
are known or may be readily produced. These receptors or ligand binding domains induce 
the FKBPs and cvclophilin receptors, the stenod receptors, the tetracycline receptor, the 
other receptors indicated above, and the like, as well as "unnatural" receptors, which can 
be obtained from ar.t.bod.es. particularly the heavy or l.ght chain subuml. mutated 
sequences thereof, random ammo acid sequences obtained by stochastic procedures, 
combinatorial syntheses, and the like. 

For the most pan. the receptor domains will be at least about 50 amino acids, and 
fewer than about 3 50 ammo acids, usually fewer than 200 amino acids, either as tne 
natural doma.n or truncated active portion tnereof. Preferably the binding domain will be 
small (<25 kDa. to allow efficient transfection in viral vectors), monomenc (this rules out 
the avidin-biot.n system., nonimmunogenic. and should have synthetically accessible, cell 
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permeable, nontoxic ligands thai can be configured for dimerization. 

Mukimerizine iieands useful in practicing this invention are multivalent, i.e.. 
capable of bindine to. and thus muitimenzing. two or more of chimeric protein moiecules 
having a lieand bindine domain. Tne multimenzing Ugand may bind to the proteins 
< containing such lieand-b.nding domains, in either order or simultaneously, preferably w«h 
a Kd value below about lO* more preferably below about 10-'. even more preferably 
below about 10-8. ^ in some embodiments below about 10"' M. The hgand preferably is 
not a protein or polypeptide and has a molecular weight of less than about > kDa. 
nreferablv below 2 kDa The ligand-binding domains of the chimeric prote.ns so 
■ 0 multimerized mav be the same or different. Ligand binding domains include among 
others, various irr.munophihn doma.ns. One example is the FKBP domain which is 
capable of binding to dimerizmg ligands incorporating FK506 moiet.es or other FRBP- 
binding moieties See e.g. PCT/US93/01617, the full contents of which are hereby 
incorporated by reference 
! s The port.on of the construct encoding the ligand binding domain can be subjected 

to mutagenesis for a variety of reasons. Tne mutagenized doma.n car. provide for higher 
bindine affinity, allow for discrimination by a hgand between the mutant ana naturally 
occurring forms of the ligand binding domain, provide opportumt.es to design a hgand- 
hgand binding aoma.n pairs, or the like. The change in the ligand binding domain can 
, 0 involve changes in ammo acds known to be at the binding sue. random mutagenesis us.ng 
combinatorial techmoues. where the codons for the ammo acids associated w.th the 
binding site or other ammo ac.ds assoc.ated with conformational changes can oe suoject to 
mutagenesis bv changing tne coaon(s) for the particular ammo acid, either with known 
changes or random! v. expressing the resulting proteins in an appropriate erokaryotic host 
^ and then screening the resulting prote.ns for binding Illustrative of this Situation ,s to 
modify FKBP12 s Phe3o to Ala and or As P 37 to Gly or Ala to accommodate a suost.tuen; 
at positions 9 or 10 of FK506 or FK520. In particular, mutant FKBP 12 mo.et.es which 
contain Val. Ala. Gly. Met or other small ammo acids in place of one or more of Tyr26. 
Phe36. Asp37. TyrS2 and Phe9<J are of particular interest as receptor domains for FK506- 
30 type and FK-520-type hgands containing modifications at and/or C!0. 

Illustrative examples of rapamycin-bmding domains are those which include an 
approximately 89-amino acid rapamycm-binding doma.n from FRAP, e.g.. containing 
residues 2025-2'. 13 of human FRAP. Similar consicerat.ons apply to the generation of 
mutant FR.AP-der.ved domains which bind preferentially to rapamycm analogs (ranalogs. 
containins modifications (i.e.. are 'bumped') relative to rapamycm in the FRAP-b.nding 
effector domain. For example, one may obtain preferential binding using rapaiogs bearing 
.ubstituents ether than -OMe at the C7 position w.th FRBs based on the human FRAP 
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FRB peptide seauer.ee but bearing am.no acid substitutions for one of more of the res.dues 
Tvr-038 Phe^039 Thr2098. Gln2099, Trp2101 and Asp2l02. Exemplary mutations 
ln 'ciude Y2038H. Y2038L. Y2038V. Y2038A. F2039H. F2039L. F2039A. F2039Y. 
D' , 102A T2098A. T2098N. andT2098S. Rapalogs bearing substituents otner tnan -OH at 
C : 8 and'or substituents other than =0 at C30 may be used to obtain preferential binding to 
FRAP proteins bearing an ammo acid substitution for Glu2032. Exemplar, mutations 
mclude E2032A and E2032S Protetns comprising an FRB containing one or more amino 
acid replacements at the foregoing positions, libraries of proteins or peptides randomized 
those pos.t.ons .i.e.. containing various substituted am.no acids at those residues). 
hbrar.es randomizing the entire protein domain, or combinations of these sets of mutants 
are made us.ng the procedures described above to identify mutant FRAPs that b.nc 
preferentially to bumped rapaiogs. 

Othe- macrol.de b.nd.ne domains useful in the present invention, including 
mutants thereof, are described in the art. See. for example. VVQ96-41 865. W096 13613. 
WO96/06111, WO96/06110. WO96/06097. W096/12796, WO95/05389. WO95 0268-. 
\V094'18317, each of which is expressiy incorporated by reference herem. 

Tne abilitv to emplov in v,tro mutagenesis or combinatorial modifications of 
sequences encoding proteins allows for the production of libraries of prote.ns which can be 
screened for b.nd.ng affinity for different ligands. For example, one can totally randomize 
a sequence of 1 to 5. 1 0 or more codons. at one or more sites in a DNA sequence encoding 
a binding protein, make an expression construct and introduce the expression construct 
•nto a unicellular microorcanism. and develop a library. One can then screen the libran 
for binding aff.n.tv to one or desirably a plurality of i.gands. The best affinity sequences 
which are compatible with tne ceils into which they would be introduced can men oe usee 
; as the I.gand bind.ng doma.n. Tne hgand would be screened with the host eel's to be usee 
to determine the ievel of binding of the I.gand to endogenous proteins. A binding prot.le 
could be defined weighting the ratio of binding affinity to the mutagenic binding 
domain with the binding affinity to endogenous proteins. Those iigands which have tne 
best binding profile could then be used as the iigand. Phage display techniques, as a non- 
0 limiting example, can be used in carrying out the foregoing 

In other embodiments, antibody subunits. e.g. heavy or light chain, particularly 
fragments, more rarfcularly all or pan of the variable region, or fus.ons of heavy and light 
cham to create single chain antibodies, can be used as the ligand bmamg doma.n. 
.Antibodies can be prepared against haptenic molecules which are physioiog,call> 
.5 acceptable and the individual anabody subunits screened for binding affinity. 1 he cDNA 
encoding the subunits can be isolated and modified by deletion of the constant region, 
ronions^f the variable region, mutagenesis of the var.able region, or the like, to obtain a 
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bintfnt orowin domain that has the appropna.e affinity tor the hgand. In this way. almost 
anv phvsiolo 8 icallv acceptable hapten.c compound can be employed as the hgand or to 
^ovide an epitope for the l.gand. Instead of antibody units, natural receptors can be 
employed, where the binding domain is known and there is a useful hgand for bino.ng. 

In vet another embodiment of the invention, the DNA binding unit is linked to 
m ore than one iieand binding domain. For example, a DNA b.nd.ng doma.n can be linked 
toatl-ast- 3 4 or 5 liaand binding domains. A DNA binding domain can also be linked 
to at least < hgand bindine doma.ns or any number of hgand binding domains. In such 
embodiments, the hgand b,nd,ng domains can be. by illustration, linked to each other in a 
v, linear arrav bv iinktne the NH2-terminus of one l.gand binding domain to the COOH- 
termmus of another hgand binding domain, e.g.. as shown in Figure 3. Thus, numerous 
composite activators can be Hnked to a single DNA binding domain in the presence of a 
hgand. 

The invenf.cn further nrovides additional induction systems. In one embodiment. 
,< the invention uses an alternative alloster.c on-switch for transcription which employs a 
d-letion mutant of the human progesterone receptor, i.e.. which no longer binds 
progesterone or ar.v known endogenous steroid but can be activated by the orally active 
progesterone antacoras, RU486. described, e.g, in Wang et ah (1994) Proc. Natl. Acad 
Sci USA 91 8180 Activation was demonstrated, e.g, in cells transplanted into mice 

- usine doses of RU486 '5-50 ug-'kg) considerably below the usual dose for inducing 
abortion in humans , 1 0 mgltg , However, according to the art describing this sy stem, tne 
induction ratio in culture and in an.mals was rather low. Applying the invention 
described herein :n this system would P rov,de an inducible system having a higner 
induction ratio. Thus, the invention provides a transcriptional activator comprising at least 

- one subunit which is covaiently linked to a mutant steroid binding doma.n to y.elc a 
transcriptional actuator which reactivates in a Rl'486-dependen, manner, resulting m 
high induction ratios 

Tne invention can be adaoted to an ecdysone inducible system Early work 
demonstrated that fusing the Drosophila steroid ecdysone (Ec receptor (EcR) Ec binding 

10 domain to heteroiocous DNA binding and activation doma.ns. such as E col: iexA anc 
herpesvirus VPlfc perm.ts ecdysone-dependent activation of target genes downstream of 
appropriate bindin, sites (Chr.stopherson et al. (1992) Proc. Natl. Acad. Sc. U.S.A. 
89 6>14^ An improved ecdysone regulation system has been developed, using the DNA 
•~ ir c-ng domain of the EcR itself. In this system, the regulating transcription tactor is 

3, provided as two orotems: ilia truncated, mutant EcR fused to herpes VP16 and (2. tne 
'mammalian homoioc I'RXRt of Ultrasp.racie protein tUSP). which heterod.menzes w.tn 
the *cR (No et al ,1996, Proc. Natl. Acad. Sc.. U.S.A. «3:3346r In this system, because 
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the DNA binding domain was also recognized by a human receptor (the human famesoic 
X receptor), it was altered to a site recognized only by the mutant EcR. Thus, the 
Invention provides an ecdysone inducible system, in which a truncated mutant EcR is 
fused to at least one subun.t of. a transcriptional activator of the invention. The 
« transcriptional activator further comprises USP. thereby providing high level mducuon ot 
transcription of a target gene having the EcR target sequence, dependent on the presence 
of ecdysone. 

The invention can also be applied to any other inducible system, thereby providing 
an inducible system having a higher ratio of background to induction due to the potent 
10 transcriptional activity of the transcriptional activators of the invention. 
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E. Additional domains and linkers 

Additional domains may be included in the transcriptional activators of this 
invention. For example, the transcriptional activators may contain a nuclear localization 
sequence which provides for the protein to be translocated to the nucleus. Accordingly. ,n 
one embodiment of the invention, at least one of the subunits of the transcriptional 
activator of the invention, e.g.. activation unit. DNA binding domain, or hgand mnaing 
domain, further comprises a nuclear localization signal (NLS). A NLS can be fused to tne 
N-terminus. or tne C-terminus of a subumt, e.g.. an activation unit, or can be inserted at 
the junction of one subun.t with another subumt. e.g., between an activation domain and a 
DNA binding or hgand binding domain or ohgomenzation domain or elsewhere in the 
protein, as lone as the function of the subunits is not disrupted by insertion of the NLS. 
Typicaliv a nuclear localization sequence has a plurality of bas.c ammo acids, referred to 
as a bipartite bas.c reseat (reviewed in Garcia-Bustos et a!. (1991) B.ochimica et 
Biophvsica Acta \0'\ .83-101 .. The NLS may be that of SV40 large T antigen which is 
comprised of amino ac.ds proline-lysine-iysme-iysine-argimne-lysine-valine (Kalderon et 
ai (19841 Cell 39:4OQ-509V The NLS may also be from a p53 protein. Wild-type p;3 
contains three nuclear localization signals (NLS). all of which map to the C-term.nus of 
wild-tvt* P 53 and specifically to residues 316-325. 369-375 and 3^-384 of p;? 
(Shauiskv et al. (IOQOi Mcl. Cell. Biol. 10:6565-6577. Additional heterologous NLS are 
described by Shauiskv et ai i 1990. supra and Shaulsky et al.(199n Oncogene 6:2056. 

The chimeric prote.r.s may include domains that facilitate their purification, e.g. 
-histidine tacs" or a glutathione-S-transferase domain. They may include -epitope tags- 
encoding peptides recognized by known monoclonal antibod.es for the detection o: 
proteins within cells or the capture of proteins by andbodies in vitro 

It may be necessary ,n some instances to introduce an unstructured polypeptide 
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linker reg.on between an activation lag or tags and other port.ons of the ch.mer.c activator. 
Where the fusion proton also includes, for example, oligomerization sequences, .t may be 
preferable to situate tne linker between the oligomerization sequences and the activation 
tags The linker can facilitate enhanced flexibility of the fusion protein, wn.le the 
< oligomerization sequences are relatively free to make other inter-protein contacts, e.g.. 
with other chimeric activators. The linker can also reduce stenc hindrance between any 
t wo fragments of the fusion prote.n. The linker can also facilitate the appropriate foldmg 
of each fragment to occur The linker can be of natural ongm. such as a sequence 
determined to exist in random coil between two domains of a protein An exemplary 
0 -inker seauence is the linker found between the C-termmal and N-termma) domains of the 
RNA polymerase a subunit. Other examples of naturally occurring linkers include linkers 
found ir 'the lcl and Lex A proteins. Alternatively, the linker can be of synthetic origin. 
For instance, the seauence lG ly4Scr)3 can be used as a synthetic unstructured lmker 
Linkers of this type are described in Huston et al. (1988) PNAS 85:4879: and U.S. Patent 
15 No. 5.091.513- both incorporated by reference herein. 

t n some embodiments it is preferable that the design of a linker involve an 
ar^aneement of domains which requires the linker to span a relatively short distance, 
preferablv less than about 10 A. However, in certain embodiments, depending, e.g., upon 
the selected DNA-binding domains and the configuration, the linker may span a distance 
20 of up to about 50 A. 

Within the linker, the amino acid sequence may be varied based on the preferred 
charactenstics of the hnxe: as determined empirically or as revealed by modeling For 
instance in addition to a des.red length, modeling studies may show that side groups of 
certain amino aods mav interfere with the biological activity, e.g. DNA D.nding or 
^ transcriptional activation, of the protein. Considerations in choosing a linxer include 
flexibility of the linker, charge of the linker, and presence of some amino aods of the 
linker in the natural lv-occumn g subun.ts. The linker can also be designed such tnat 
residues in the linker contact DNA. thereby influencing binding affinity or specificity, or 
to interact w.th other protons. For example, a linker may contain an amino aod sequence 
,0 which can be recognized by a protease so that the activity of the chimeric proton could be 
reflated bv cleavaee. In some cases, particularly when it is necessary to soar, a longer 
distance between subun.ts or when the domains must be held in a particular configuration, 
the linker may optionally contain an additional foided domain. 

Most of the sub.ect fus.on proteins can be tested for activity in vivo us.ng a simple 
3, assav (F M Ausube! et al Eds Current Protocols in Molecular Biology. John W.ley & 
Sons New York 1994. de Wet et al. (19871 Mol. Cell Biol. 7:725). The in vivo assay 
^eau<res an expression construct containing and capable of directing the expression ol a 
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recombinant DNA sequence encoding the compos.te activator, and as appropr.aie. other 
proteins reqmred for DNA localization of the activator. The assay also requires a plasma 
containing a reporter gene , e.g.. the iuciferase gene, the chloramphenicol acetyl 
transferase (CAT) gene, secreted alkaline phosphatase or the human growth normone 
(hGH) gene, linked to a binding she for the transcription factor. The expression constructs 
are introduced into host cells which normally do not produce interfering levels of tne 
reporter eene product. A second group of cells, which lacks the composite activator or the 
means for localizing the activator to the reporter gene can serve as the control. 

The production of mRNA or protein encoded by the reporter gene is measured. An 
increase m reporter eene expression not seen in the controls indicates that the transcnpt.on 
factor is a positive regulator of transcription. If reporter gene expression is less than that of 
the control, the transcnpt.on factor is a negative regulator of transcription. 

Optional*, tne assay may include a transfection efficiency control plasmid. 'This 
piasm.d expresses a gene product independent of the test gene, and the amount of this gene 
« product indicates rouchlv how many cells are taking up the plasmids and how efficiently 
the DNA is being introduced into the cells. Additional guidance on evaluating ch.mer.c 
proteins of this invention is provided below. 
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111. Nucleic Acid Compositions 

In another aspect of the invention, the proteins described herein are provided m 
expression vectors For instance, expression vectors are contemplated which include a 
nucleotide sequence encoding a polypeptide containing a composite activator of tne 
present invention, which cocinc sequence is operabiy linked to a. least one transcnpuonai 
regulator sequence. Regulatory sequences for directing expression of the mstant tusion 
proteins are art-recocmzed and are selected b> a number of well understood criteria. 
Exemnlarv reeuiatorv sequences are described in Goeddel; Gene Expression Technology: 
Methods in Enzvnology. Academic Press. San D.ego. CA (1990). For instance, am o, a 
wide varietv of expression control sequences that control the expression ot a DNA 
seauence when operative* linked to it may be used in these vectors to express DNA 
sequences encoding tne fusion prote.ns of this invention. Such useful expression control 
seauences. include, for example, the early and late promoters of SV40. adenovirus o: 
cvtomeealovtrus immediate early promoter, the lac system, the trp system, the 1 AC o: 
TRC svstem. T7 promoter whose expression is directed by T7 RNA polymerase, tne 
promoter for 3-phospho C ivcerate kinase or other glycolytic enzymes, the promoters ot acid 
phosphatase, e.g., Pho5, and the promoters of the yeast a-maung factors anc other 
.cauences known to control the expression of genes of prokaryotic or eukaryotic cells or 
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t-er viruses and various combinations thereof. It should be understood that the design of 
the expression vector may depend on such factors as the choice of the host cell to be 
•reformed. Moreover, .he vector s copy number, the ability to control that copy number 
and the expression of any other protein encoded by the vector, such as anub.ouc markers. 
< should also be considered. 

As »ill be apparent, the subject gene constructs car. be used to cause expression of 
the subject fusion proteins in cells propagated in culture, e.g. to produce nrote.ns or 
polypeptides, including fusion proteins, for punficanon. 

This invention also pertains to a host cell transfected with a recombinant gene in 
i0 order to express one of the subject polypeptides. The host cell may be any prokarvotic or 
eukarvotic cell. Fc: examp'.e. a fus.on proteins of the present invention may be expressed 
in bacterial cells such as E. co',. insect cells (baculo virus), yeast, or mammalian cells 
Other suitable host cells are known to those skilled in the art. 

According , the present invention further pertains to methods of producing the 
■ S subject fusion proteins. For example, a host cell transfected with an express.cn vector 
encoding a protein of .merest car. be cultured under appropriate conditions to allow 
express.^ of the rrotem to occur. The protein may be secreted, by inclusion of a 
secretion sicnal sequence, and isolated from a mixture of cells and medium containing the 
protein. Ahernauveiv. the protein may be retained cytoplasm.caHy and the cells harvested. 
^0 ivsed and the protein isolated. A cell culture includes host cells, media and other 
byproducts. Suitable media for cell culture are well known in the art. The proteins can be 
isolated from cell culture medium, host ceils, or both us.ng techniques known in the art for 
Pur.fv.ng prote.ns. including .on-exchange chromatography, gel filtration chromatography, 
ultrafiltration, electrophoresis, and immunoaffimty purification with antibodies spec.nc lor 
25 particular epitopes of the protein 

Thus a codine sequence for a fusion protein of the present invention can be used to 
produce a recombinant form, of the prote.n via microbial or eukarvotic celluiar processes 
L, 2 at,ne the noivnucleot.de sequence into a gene construct, such as an expression vector, 
and transforming or transfectmg into hosts, either eukarvotic lyeast. avian., insect or 
30 mammalian, or prokaryotic . bacterial ceils ;, are standard procedures. 

Expression vehicles for production of a recombinant protein include pernios and 
other vectors. For instance, suitable vectors for the expression of the ir.sumt fusion 
proteins induce p.asm.ds of the types: pBR322-denved p.asm.ds. PEMBu-dcnvec 
plasm.es. P EX-denved plasm.ds. pBTac-denved plasm.ds and pL'C-derived olasmids tor 
3 5 expression in prokaryotic cells, such as E. coh. 

A number of vectors exist for the expression of recombinant prote.n, m yeast. For 
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instance YEP24. YIPS, YEP51, YEP52. pYES2. and YRP1? are clon.ng and expression 
vehicles useful in the introduction of geneuc constructs into S cerevisiae (see. tor 
example Broach et ai.. (1983) in Experimental+Man.pulat.on+of+Gene+Express.on. ed. 
M Inouve Academic Press, p. 83. incorporated by reference herein,. These vectors can 
replicate in E. coli cue the presence of the pBR322 on. and in S cerevisiae due to me 
replication determinant of the yeast 2 m.cron plasmid. In addition, drug resistance 
markers such as ampicillin can be used. 

The preferred mammalian expressmn vectors contain both prokaryotic sequences 
to facilitate the propagation of the vector in bacteria, and one or more eukaryot.c 
, transcnption units mat are expressed in eukaryot.c cells. The pcDNALamp. pcDNAI/nec. 
PR-/CMV P SV2cpt. pSV2neo. pSVZ-dhfr. P Tk2. pRSYneo. pMSG. pSYTT. P ko-neo and 
pHva derived vectors are examples of mammalian expression vectors suitable tor 
transaction of eukarvonc cells Some of these vectors are modified with sequences from 
bacterial piasmids. such as P BR322. to facilitate replicat.on and drug resistance selection 
< in both prokarvotic and eukaryotic cells. Alternatively, derivatives of viruses sucn as me 
bovine papilloma virus (BPV-1 ,. or Epstein-Bart virus (pHEBo. pREP-denved and P 20,! 
can be used for transient expression of proteins in eukaryot.c cells. Examples of otner 
viral (including retroviral , expression systems can be found below in the description of 
gene therapv del.verv systems The various methods employed in the preparation of me 
•0 piasmids and transformation of host organisms are well known m the an. For other 
suitable expression svstems for both prokaryotic and eukarvonc cells, as well as general 
recombinant procedures, see Molecular Clomng:tA Laboratory+Manual. 2nd Ea.. ed. r> 
Sambrook. Fritsch and Maniatis (Cold S P nng Harbor Laboratory Press. 1989) Chapters 16 
and r In some instances, it mav be desirable to express the recombinant fusion proteins 
- bv the use of a baculovirus expression system. Examples of such baculovirus express.cn 
sterns include F YL-denved vectors (such as pVL1392. pVL1393 and pVL941 ,. 
pAcUW-derived vectors (sucn as pAcUWl ). and pBlueBac-denved vectors (sucn as me _- 
gal containing pBiueBac 111). 

In vet other embodiments, the subject expression constructs are derived r> 
,C insertion of the sub.ec. gene into viral vectors including recombinant retroviruses, 
adenovirus, adeno-associated vims, and herpes simplex v,rus-l. or recombinant oacter.a, 
or eukarvonc piasmids. As described in greater detail below, such embodiments ot me 
subject expression construes are specifically contemplated for use in various m v.vo an, 
ex vivo gene therapy protocols. 

Retrovirus vectors and adeno-assoc.ated virus vectors are generally understood to 
be the recombinant cene delivery system ofcho.ee for the transfer of exogenous genes :n 
v.vo oarticulariv into humans. These vectors prov.de efficient delivery ot genes into 
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cells. and the transferred nucleic acids are stably integrated into the chromosomal DNA of 
the host. A maior prerequtsite for the use of retroviruses is to ensure the safety of their 
use, particularly with regard to the possibility of the spread of wild-type v.rus in the cell 
population The development of specialized cell lines (termed "packaging cells' ) which 
produce only replication-defective retrov t ruses has increased the utility of retroviruses for 
gene therapv. ana defective retroviruses are well characterized for use in gene transfer for 
gene therapv purposes (for a review see Miller. A.D. (1990) Blood 76:271). Thus, 
recombinant retrovirus can be constructed in which part of the retroviral coding sequence 
(gag, pol. env) has been replaced by nucleic acid encoding a fusion protein of the present 
invention, e.g.. a composite activator, rendering the retrovirus replication defective. The 
replication defective retrovirus is then packaged into virions which can be used to infect a 
target eel! through the use of a helper virus by standard techniques. Protocols for 
producing recombinant retroviruses and for infecting cells in vitro or in vivo with such 
viruses can be found in Current Protocols in Molecular Biology. Ausubel, F.M. et al.. 
feds.) Greene Publishing .Associates. (1989). Sections 9.10-9.14 and other standard 
laboratory manuals. Examples of suitable retroviruses include pLJ, pZIP, pWE and pEM 
which are well known to those skilled in the art. Examples of suitable packaging virus 
lines for preparing botn ecotropic and amphotropic retroviral systems include SYMBOL 
121 \f "SymborCnp. SYMBOL 12! \f "Symbol"Cre. SYMBOL 121 \f "Symbols and 
SYMBOL 121 \f "Symbol-Am Retroviruses have been used to introduce a variety of 
genes into many different cell types, including neural cells, epithelial cells, endothelial 
cells, lvmphocv.es. myoblasts, hepatocytes. bone marrow cells, in vitro and'or m vivo (see 
for example Eelius et al.. (1985) Science 230:1395-1398: Danos and Mulligan. (1988) 
PNAS USA 85:6460-0464: Wilson et al.. (1988) PNAS USA 85:3014-3018: Armentano et 
al.. (T990) PNAS USA 87:6141-6145: Huber et al.. (1991) PNAS USA 88:8039-8043. 
Ferry et al.. (1991) PNAS USA 88:8377-8381: Chowdhury et al.. ( Science 
^4-1802-1805 van Beusechem et al., (1992> PNAS USA 89:7640-764-:. Kay et al.. 
(1992) Human Gene Therapy 3:641-647; Da, et al.. (1992) PNAS USA 89:10892-10895: 
Hwu et al.. (19931 J. Immunol. 150:4104-4115; U.S. Patent No. 4.868.116: U.S. Patent 
No. 4.980.286; PUT Application WO 89/07136; PCT Application WO 80/02468: PCT 
Application WO 89 05345: and PCT Application WO 92'07573). 

Furthermore, it has been shown that it is possible to limit the infection spectrum of 
retroviruses and consequently of retrovirai-based vectors, by modifying tne viral 
packaging prote-.r.s on the surface of tne viral panicle (see. for example PCT publications 
W093 -25234. WOO4 '06920, and W094/U 524). For instance, strategies for the 
modification of the infection spectrum of retroviral vectors include: coupling antibodies 
specific for cell surface antigens to the viral env protein iRoux et al.. (1989) PNAS USA 
86:9070-9083: Julan et al.. (1992) J. Gen Virol 73:3251-3255: and Goud et al.. .1983) 



PCT/US97M5219 

WO 99/10508 



-4! 



v , ,v !63-M-^4r or coupl.ng cell surface Uganda to the vual env proteins (>,eaa 

!l t «' rn-rr ">66 1414^-14146 . Coupling can be m the form of the chem.cal 
al., (1991 . J. Biol. Cnem. 66.14 ^ ^ ^ m 

cross-Unking with a protein or Cher vanet> (e.g. lactose 5ingle . chain 

^11 a<; bv generauna fusion proteins ^e.g. 
asialoglycoprotein, as 1 » * J whlie useful to Hmn or otnenv.se direct the 
antibody-env fusion proteins... This ^.que, ^ . r ^ 

infection to certain tissue types, and can also be used 
an amphotropic vector 

i .. -rv , v ,. e rn useful in the present invention utilizes 
adenovirus-derivec vector; . Tne genom ^ „ f ,, s abll , y to repl ,c,« ,„ a 

encodes a gen. product 0, ,n.e.«s . be., .s m BtoTechntques 6:616: 

r rjsrj^^si^ « »- <>»» « 

T V ' o J served notn .he adenovtsus «* Ad type 5 d.324 « 

of ce„ .pes. tnCodtnP I^SU^XUm^ 
r' sTsA 89 s'-L Fststnennose. the vtnss pantele ,s re,a„v=K »bk add 

:5 situations where introduced DNA bee.orr.es inte t 

,l„ cacacitv of the adenoviral genome tor .oreign v 

Ahmand and Granam J > iroi. - ^let« J for ail or 

,4 • ^tnro fivored bv the present invention are deieu- ior 
vecl0 rs currently in use^c ^ ^ as 80n of the adenoviral gene.c 
30 parts of tne v.ral c an E. ge - el a ,. su?ra: and Graham et 

nnatenal (see. e.g.. .ones e ; .1 , C (H umana. Clifton. NJ. > vo, 

aK . in Methods in Molecular 3.olog>. E.-. Murra. . 

mo -x-ression of the inserted chimenc gene can be und_. co..tro 

e ^ p r; r 1., promote, the ma,r late promoter ,MLP, and associated leader 
, lances, the viral E3 promoter, or exogenous, added promoter sequences. 

Yet another v,al vector system useful for deliver,- of the subject chimeric genes s 
|AAV | ^deno-associated virus is a natural, occumn, 
the adeno-associatea virus (AA\ I Aaeno as 
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defective virus that reau-.res another virus, such as an adenovirus or a herpes virus, as a 
heloer virus for efficient replication and a productive life cycle. (For a review, see 
Muzvczka et al., Curr Topics in Micro, and Immunol. (1992) .58:97-129). It .s also one 
of the few viruses that may integrate its DNA into non-dividing cells, and exhibits a high 
frecuencv of stable in.egrauon ,see for example Hone et al.. (1992) Am. J. Res F ir. Cell 
Mo'l Biol "49-356; Samuisk. et al., (1989) J. Virol. 63:3822-3828: and McLaughlin et 
al ,1989) J Virol 62:1963-1973). Vectors containing as linle as 300 base pairs of AAV 
can be packaged and can integrate. Space for exogenous DNA is limited to about 4.5 kb. 
An .AAV vector such as that described in Tratschin et al.. 0985) Mol. Cell. Biol. 5:3251- 
r, 3^60 can be used to introduce DNA into cells. A variety of nucle.c acids have been 
introduced into different cell types using AAV vectors (see for example Hermonat et al.. 
(1984) PNAS USA 81:6466-6470: Tratschin et al., (1985) Mol. Cell. Biol. 4:20-2-2081. 
Wondisford et al.. (1988, Mol. Endocrinol. 2:32-39; Tratschin et al.. (1984) J. Virol. 
51:611-619. and Florte et al.. (1993) J. Biol. Chem. 268:3781-3790). 

Other viral vector systems that may have application in gene therapy have been 
derived from heroes virus, vaccinia virus, and several RNA viruses. In particular, herpes 
virus vectors mas rrovide a unique strategy for persistence of the recombinant gene sn 
cells of the central nervous system and ocular t.ssue (Pepose et al.. (l«X»4i invest 
Ophthalmol Vis Sci 35:2662-2666) 
, 0 In addition to %iral transfer methods, such as those illustrated above, non-viral 

methods can also be employed to cause expression of a protein in the tissue of an an.mal 
Most nonviral methods of gene transfer rely on normal mechanisms used by mammalian 
ceils for the uptake and intracellular transport of macromolecules. m preterred 
embodiments, nor, viral gene delivery systems of the present invention rely on enaocync 
pathways for the uptake of the gene by the targeted cell. Exemplary gene delivery systems 
of this type include liposomal derived systems, poly-lysine conjugates, and artiticiai vira; 
envelopes. 

In a representative embodiment, a gene encoding a composite activator can be 
entrapped in liposomes bearing positive charges on their surface (e.g.. Hpofectins. and 

,0 (oouonallv) which are tagged with antibodies against eel! surface antigens of tne target 
tissue (Mizuno et al.. ( 1 992) No Shinke, Geka 20:547-551 . PCT publication \\ 091 06309. 
Jaoanese patent application 1047381; and European patent publication EP-A-430O) For 
example. Hpofection of neuroglioma cells can he carried out using liposomes tagged witn 
monoclonal antibod.es against glioma-associated antigen (Mizuno et al.. (1992) Neurol. 

35 Med. Chir. 32:873-876). 

In vet another illustrative embodiment, the gene delivery system composes an 
antibodv or cell surface hgand which is cross-linked with a gene binding agent such as 
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poly-lysme .see. for example. PCT publications WO93'04701. W092/22635. 
W'092'20316. W092/19749. and WO92/061 80). For example, any of the subject gene 
constructs can be used to tiansfect specific cells in vivo using a solubie poiyr.ucieotide 
carrier comprising an antibody conjugated to a polycauon. e.g. poly-lysine (see U.S. Patent 
: 5.166.320). It will also be appreciated that effective deliver,- of the subject nucleic acid 
constructs via -mediated endocytosis can be improved using agents which enhance escape 
of the gene from the endosomal structures. For instance, whole adenovirus or rusogenic 
peptide's of the influenza HA gene product can be used as part of the delivery system to 
induce efficient disruption of DNA-containing endosomes (Mulligan et al.. (1993) Sc.ence 
260-926; Wagner e: al.. ( 1992) PNAS USA 89:^934; and Christiano et al.. (1993) PNAS 
USA 90:2122). 

In clinical settings, the gene delivery systems can be introduced into a patient by 
any of a number of methods, each of which :s familiar in the an. 

For instance, a pharmaceutical preparation of the gene deliver.' system can be 
introduced systemically. e.g. by intravenous injection, and specific transduction of me 
construct in the target cells occurs predominantly from specificity of transfection provided 
by the gene delivery vehicle, cell-type or tissue-type expression due to the transcriptional 
regulatory sequences controlling expression of the gene, or a combination thereof, in 
other embodiments, inmal delivery of the recombinant gene is more limited with 
introduction into the animal being quite localized. For example, the gene delivery vehicle 
can be introduced by catheter (see U.S. Paten: 5.328.470) or by stereotactic injection (e.g. 
Chen et al.. (1994) PNAS USA 91 : 3054-3057). 



IV. Target gene 

As used herein, the term "target gene" refers to a gene, whose transcription is 
stimulated according to the method of the invention. In a preferred embodiment, the gene 
is integrated in the chromosomal DN A of a cell. A cell comprising a targe: gene is 
referred to herein as a ""target cell" 

In a preferred embodiment of the invention, the target gene is ar. endogenous gene 
As used herein, the term "endogenous gene" refers to a gene which is naturalb present :n a 
cell, in its natural environment, i.e.. not a gene which has been introduced into the cell b> 
genetic engineering. The endogenous gene can be any gene having a promoter that <> 
recognized by at least one transcription factor. In a preferred embodiment, the promoter or 
any regulatory element thereof, of the endogenous gene ("endogenous promoter" and 
"endogenous regulatory element", respectively), is recognized by a known, preferabb 
cloned. DNA binding protein, whether it is a transcriptional activator or repressor 
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experimentation, such as screening P nNA binding domain to a target 

Jeet promote, Furthermore, the aftimrv of bmain ofaDNA Z 

sequence can be improved according to me* £ - —"domain ^ „ for 

comprise, e.g.. introducing mutat.ons into the DNA 

nutans having increased DNA binding affinity. 

„,, ch com„n S „ «*~« -«« ".^rrponio; of ,he endogenous 
proper. In preferred err.ocd.men.s. ,h= «og P 

capable of binding wu. h.g, a..m.rv protein can 

be irom any otner specie*, r 

„ lh - u^e, gene is . exogenous gene. .» . p-fcn- 
In vet another etrrooormer.t. the urge! ««™ ,„ m ,l DNA of a cell The 

— * r^ir^=t Zhoo, gene - 
exoeenous gene can be inserter imo ^ m jn a 

sm6le cop> or -;- tn " b ' Jen, » -la ,h„ one cop, 

can be used. 

,„ m e - rage, gene cons™, envies .ansa npoon o , . £ £ 

" to he reared ha . option facor ,n COMls „„ e of: 

DNA moiecole rvh.eh nrdudes , symheue ««c„puon ^^Lh.aff.n.u bv rhe DNA 
« „ py mul „P,e cop,e S of n ^ e ntetoe. „ of . pro.e.n 

-.0 w h,chrecro,,s,heaornpos,,= ac.,ra.on<-, P lrans c„p„on factor 

TATA co. »d stance » • ^UT^dae,. nrc.ud.ng fences d-.a, 

«~ -a. . ^r^l T^*- « ■»«*■«= <*• - — ; 

promote the imitation an- termination int:on DNA". and 

^ (n-) a sequence directing cleavage ai y . 

■ A of ,enes „ he ^ -he ~^£^£r 

intlQpnqe seauence or nbozvme ot interest i ne un_ ^ 
therapeutic protein, antisense *equcu 
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sequence of interest which provides a desired phenotype. It can encode a surface 
membrane protein, a secreted protein, a cytoplasmic protein, or there can be a plurality of 
target genes encoding different products. The target gene may be an antisense sequence 
which can modulate a particular pathway by inhibiung a transcriptional regulation protein 
5 or rum on a particular pathway by inhibiting the translation of an inhibitor of the pathway. 
The target gene can encode a ribozyme which may modulate a particular pathway by 
interfering, at the RNA level, with the expression of a relevant transcriptional regulator or 
with the expression of an inhibitor of a particular pathway. The proteins which are 
expressed, singly or in combination, can involve homing, cytotoxicity, proliferation, 
10 immune response, inflammatory response, clotting or dissolving of clots, hormonal 
regulation, etc. The proteins expressed may be naturally-occurring proteins, mutants ot 
naturallv-occurring proteins, unique sequences, or combinations thereof. 

Various secreted products include hormones, such as insulin, human growth 
hormone, glucaeon. pituitary releasing factor, ACTH. melanotropin, relaxin. etc., growth 
15 factors, such as EGF. IGF-1, TGF-a. -p, PDGF. G-CSF, M-CSF, GM-CSF, FGF. 
erythropoietin, thrombopoietin. megakaryocyte stimulating and growth factors, etc.. 
interieukins, such as IL-1 to -13; TNF-cc and -B. etc.; and enzymes and other factors, such 
as tissue plasminogen activator, members of the complement cascace, perforins, 
superoxide dismutase. coagulation factors. antiihrombin-III, Factor VIIlc. Factor VIIIvW. 
20 Factor IX. a-antitrypsm. protein C. protein S. endorphins, dynorphin. bone morphogenetic 
protein, etc. 

The gene can encode a naturally-occurring surface membrane protein or a protein 
made so by introduction of an appropriate signal peptide and transmembrane sequence. 
Various such proteins include homing receptors, e.g. L-selectin (Mel- 14). biood-related 
25 proteins, particularly having a kringie structure, e.g. Factor VIIlc. Factor VIIIvW, 
hematopoietic cell markers, e.g. CD3. CD4, CDS. B-cell receptor, TCR subunns a. p, y. c. 
CD10, CD19, CD28. CD33. CD38. CD41, etc., receptors, such as the interieukin receptors 
IL-2R. IL-4R. etc.. channel proteins for influx or efflux of ions, e.g. Ca+2. Na-. G- 
and the like; CFTR. tyrosine activation motif, zap-70, etc. 

30 Proteins may be modified for transport to a vesicle for exocytosis. Bn adding tne 

sequence from a protein which is directed to vesicles, where the sequence is modified 
proximal to one or the other terminus, or situated in an analogous position to the protein 
source, the modified protein will be directed to the Golgi apparatus for packaging m a 
vesicle. This process in conjunction with the presence of the chimeric proteins tor 

35 exocytosis allows for rapid transfer of the proteins to the extracellular medium and a 
relatively high localized concentration. 

Also, intracellular proteins can be of interest, such as proteins m metabolic 
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pathwavs reeulatorv prote.ns. stero.d receptors, transcnpuon factors, etc.. depending upon 
the nature of the host cell. Some of the proteins indicated above can also serve as 
intracellular proteins. 

Bv wav of further illustration, in T-cells. one may w.sh to introduce genes 
encoding one or bom chams of a T-cell receptor. For B-cells. one could prov.de the heavy 
and light chams for an immunoglobulin for secretion. For cutaneous cells, e.g. 
keratinocvtes. particularly stem cell keraunocytes. one could prov.de for protecuon agamst 
infection.' by secreting a-. B- or ^-interferon, antichemotact.c factors, proteases specific for 
bacterial cell wall proteins, etc. 
, In addition to providing for expression of a gene having thenapeutic value, there 

will be manv situations where one may wish to direct a cell to a particular sue. The Site 
can include anatom.cal sites, such as lymph nodes, mucosal t.ssue. skin, synovium, lung or 
other internal orcans or functional sues, such as clots, injured sites, sites of surgical 
manipulation, inflammation, infection, etc. By providing for expression 
5 membrane proteins which will direct the host cell to the particular site by providing for 
bind.ne at the host target site to a naturally-occurring epitope, localized concentrations of a 
secreted product can be achieved. Prote.ns of interest include homing receptors, e.g. L- 
selectin GMP140. CLAM-1. etc.. or addressms. e.g. ELAM-1. PNAd, LNAd, etc.. clot 
bindme proteins, or cell surface prote.ns that respond to localized gradients of chemotact.c 
:0 factors" There are numerous sanations where one would w.sh to d.rect cells to a particular 
sue. where release of a therapeutic product could be of great value. 

For use in gene therapy, the target gene can encode any gene product that is 
beneficial to a subject. The gene product can be a secreted protein, a membraneous 
protein or a cvtoolasm.c protein. Preferred secreted prote.ns include growth factors. 
- differentiation factors, cytokines. >nterieuk.ns. tPA. and erythropoietin. Prelerrec 
membraneous proteins include receptors, e.g. growth factor or cytokine receptors or 
prote.ns mediat.ng apoptos.s. e.g.. Fas receptor Other cand.date therapeutic genes axe 
disclosed inPCT-rS93/01617. 

In vet another embodiment, a -gene activation" construct which, by homologous 
30 recombination wth a genomic DNA. alters the transcriptional regulatory sequences or an 
enaoeenous gene, can be used to .ntroduce recogn.t.on e.ements for a DNA b.no.ng 
activity of one of the sub.ee: engineered proteins. A vareity of different formats for tne 
eene acnvat.on constructs are available. See. for example, the Transkaryotic Therap.es. Ir.c 
PCT publ.cat.ons WO93'09==. WO9511560. W09&WU. WO95/31560 and 
35 WO94/12650. 




PCT/US9"V1S219 

WO 99/10508 

.47 - 

V. Methods of the inv ention 

The invention provides methods for stimulating transcription of a target gene, in 
particular, an integrated gene. The gene can be an endogenous gene, an exogenous gene, 
or a combination of both. The method of the invention comprises introducing into a targe: 

5 ceil a multiplicity of transcriptional activation units linked to a DNA binding domain 
and/or a ligand binding domain. If the transcriptional activation units are linked to a 
licand binding domain, then the method further comprises introducing into the cell a DNA 
binding domain linked to at least one ligand binding domain and a ligand capable of 
interacting with tne ligand binding domains, such that the transcriptional activation units 

10 and DNA binding domain axe interacting, thereby stimulating transcription of a targe: 
gene. 

In another embodiment, the method of the invention comprises introducing into a 
target cell a multiplicity of ligand binding domains linked to a DNA binding domain 
together with at least one transcriptional activity unit linked to a ligand binding domain 
!5 and a ligand which is capable of simultaneously binding to the two ligand binding 
domains. Thus, exposure of a target cell with an appropriate ligand will result in 
formation of a transcriptional activator complex comprising a multiplicity of activation 
units. Furthermore, since, as shown herein, the level of transcription of a target gene is 
directly correlated to the number of activation units that can be delivered to the target 
20 gene, the level of transcription of a target gene can be controlled by the level of ligand 
used to treat the target cell. 

In a preferred embodiment, the method of the invention comprises introducing into 
a cell a target gene, a nucleic acid encoding a polypeptide having a multiplicity of 
activation units and a DNA binding domain or a ligand binding domain. If the polypeptide 
25 comprises a ligand binding domain, then the method further comprises introducing into the 
cell a nucleic acid encoding a polypeptide comprising a DNA binding domain and a ligand 
binding domain. The nucleic acids and vectors comprising such can be prepared as 
described above. These can be introduced into a cell composing a target gene, i.e., targe: 
cell, according to methods known in the art. In embodiments in which the targe: cell is in 
30 vitro, the nucleic acids can be introduced into the target cell by transfection involving 
electroporation, chemical transformation (e.g.. calcium chloride), liposomes, or virai 
transformation. In embodiments m which the target cell is in vivo, the nucleic acias car. 
be introduced into the cell by direct injection of naked DNA. liposomes, or any other 
method of administration of nucleic acids to a subject. 
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VI Kits 

This invention funher provides kits useful for the foregoing applicauons. One 
such kit contains one or more nucle>c acids encoding a transcriptional activator or subunits 
thereof. The kit mav further comprise an additional nucle.c acid containing a target gene 
linked to a DNA sequence to which the transcriptional activator is capab.e of binding. 
A'temativelv. the additional nucle.c acid may contain a cloning sue for insertion of a 
desired target gene bv the practitioner. For regulable applications, i.e.. in cases m wmch 
the recombinant protein contains a l.gand binding domain or inducible domain, tne k.t may 
farmer contain an oiieomerizing agent, such as the macrol.de dimerizers discussed above. 
Such kits may for example contain a sample of a dimenzing agent capable of aimenzmg 
the wo recombinant proteins and activating transcription of the target gene. 

VII. Exe m plary Uses 

The invention croviaes methods for stimulating transcription of a target gene at 
high levels, in particular of target genes integrated into chromosomal DNA. Strong 
transcriptional activators had not been prepared in the past. Since it was knov^ that potent 
transcriptional activators would induce squelching. However, as shown herein, 
transcription of an integrated gene can be stimulated at significantly higher levels than 
non-,ntegrated eenes due to the absence of squelch.ng on integrated genes. Th.s 
observation is important in the context of certain gene therapy applications because ,t 
suggests that hiehiv ootent transcriptional activators can be used to drive therapeutic gene 
expression to verv hich levels without general toxicity to the cell and that en.cacious 
levels of secreted therapeutic proteins may be attained with fewer engineered cells mar. 
previously thought. Furthermore, the invention provides a method for controlling the level 
of transcription, by controlling the amount of transcriptional activation units delivered to 
the target gene by use of varying amounts of the oligomenzer. 

In one embodiment, the invention is used to produce higher levels of a desired 
protein ex vivo. Production of recombinant therapeutic proteins for commercial and 
invesueational purroses is often achieved through the use of mammalian cell lines 
o engineered to express the prote.n at high level. Tne use of mammalian cells, rather man 
bacteria or veast. is indicated where the proper function of the prote.n recuires post- 
radiational modifications not generally performed by heterologous cells Examp.es ot 
proteins produced commercially this way include erythropoietin, tissue plasminogen 
activator, clotune factors such as Factor VIII x. ant.bod.es. etc. The cost o: producing 
, proteins in this fashion is directly related to the level of expression achieved m the 
engineered cells Tnus. because the invention described herein can achieve considerably 
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higher express.on levels than conventional expression systems, it may greatly reduce the 
cost of crotein production. A second limitation on the production of such proteins is 
toxicity to the host cell: Protein expression may prevent cells from growing to high 
density, sharply reducing production levels. Therefore, the ability to tightly control 
< protein expression, as described for regulated gene therapy, permits cells to be grown to 
high densitv in the absence of protein production. Only after an optimum cell density is 
reached, is" expression of the gene activated and the protein product subsequently 
harvested. 

A similar problem is encountered in the construction and use of '-packaging lines" 
10 for the production of recombinant viruses for commercial (e.g.. gene therapy) and 
experimental use. These cell lines are engineered to produce viral proteins required for the 
assembly of infectious viral particles harbor.ng defective recombinant genomes. Viral 
vectors that are dependent or. such packaging lines include retrovirus, adenovirus, and 
adeno-associated virus. In the laner case, the titer of the virus stock obtained from a 
!« packaging line is directly related to the level of production of the viral rep and core 
proteins. But these proteins are highly toxic to the host cells. Therefore, it has proven 
difficult to generate high-t.ter recombinant viruses. This invention provides a solution to 
this problem, by allowing the construction of packaging lines in which the rep and core 
genes are placed under the control of regulatable transcription factors of the design 
20 descnbed here. Tr.e packaging cell line can be grown to high density, infected with helper 
vims and transferee with the recombinant viral genome. Then, expression of the viral 
proteins encoded by the packaging cells is induced by the addition of dimenz.ng agent to 
allow the production of virus at high titeT. 

In other embodiments, the subiect constructs are used as part of a therapeutic 
25 treatment program from an animal. In one embodiment, the constructs of the invention are 
used to stimulate transcription of an endogenous gene of a subiect. The endogenous gene 
can be any gene of the genome, increased expression of which is beneficial to a subject. 
For example, a subiect may not produce sufficient amounts of a specific protein, due to a 
defect in a protein reguiaung the express.on of the gene encoding the protein. In other 
30 embodiments, expression of a target gene is desired to compensate for the deficiency of 
expression of another gene. In yet other embodiments, the method of the invention is used 
to stimulate expression of an endogenous gene to compensate for a loss of tne protein 
encoded bv the encogenous gene. For example, the number of cells producing a specific 
secreted protein may be reduced in a subiect. e.g. as the result of a disease or condition. 
35 thus resulting in reduced production of the specific factor in the subiect. 

In a preferred embodiment, the method of the invention is used to stimulate 
production of a factor which is necessary for the proliferation and or differentiation of one 
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destroyed ec bv radiotherapy or cnemotherapy . According^. . 

ZZZl of =,ur,opo,e,,„ arc produced in *e subjec, and to, producon „■ red blood 
ZZZZLi «Lk* m diseases or co„d,„on, in which one or m o,= spee.ne e 
rales are desdoved bv rhe disease process, e.g., in nonimmune diseases, the specific cells 
ri ep emshed op sr.muiahog expression of one or more genes encoding fac or 
Zl£ P^f-ion of rhese ceUs. The mediod of the invenoon can a so be 
"e number of lymphocyues ,„ a sub.ec, having AIDS, such » ^ 
expression of lymphokanes. eg. IL-l. which sr.mulares prolderadon of ceri.in T helper 
iTh) cells. 

« leas, one advanuage of increasing ,he producon of » specific prore.n ma 
sub ,ec, bv sumulaimg egression of uhe endogenous gen. encoding * 
„ absence of an — reaction agaurs, dr= proie.n. drus resulting ma more fneien, 
of ,he subiec, Funhermore. for die same reason. ,, ,s des.r.ble ma, rh 
"uonal activaio, or nucleic acd encoding such administered .o a subiec, orig.n. 
fZ Z same spaces as rha, of ,h= subjec, ,o which „ is ,dmin,s,er=d. In pardcumn ,, s 
,0 adm.n.s.er , iranscpt.ona, activator of human origin ,0 a human subjecr 
„ However. uanscnunonal acva.ors having acdv.non orgs which are no, of 

1 also be used according ,o ,he me.hods of ,he ,„v=n„o„. In such condiii.n, „ ma> oe 
,o —as, adminisicr ,0 me subiec, an immunosuppressant drug, c.g 
ovcloaporin A. or other compound which represses immune reactions. 
Immunosuppressive drags are well known in die an. 

Cells which have been modified ex vivo with die DNA constructs of me present 
"" ,„ve»,,o„ mav be urowx. ,„ cui.ure under seleci.ve cond.uon, and cells which are selected 
ng ,he desired consmrcus, mav dren he expanded and furahe, analyzed, us.ng. for 
"dne polvmemse chain re.con for determining ,he presence of the cons.rac, ,n 
2 ho cells undo, aasoys for me producon of ,he desired gen. product,. Once 
: .o liL bos, ..Us has. bu.n idennfi.d. di.y may ton be us.d as placed, e.g. grown ,n 
culture or introduced into a host organism. 

Deoendmc upon ihe na.ure of ,h= cells. ,he cells may be introduced imp a host 
„ B amsm e.g. a'mLmal. in a w,de vane.y of ways Hematopoietic cells may be 
ZL.ared by unecoo in,0 the vascular system, diere being usualiy « leas, 
-.. cells and generallv no, more ihan abou, 1010 ceils. The numoe, o, c.lis which « 
e " Jed w,„ depend unon a number of circumstance, die purpose for .he introduce 
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admiRistrations. the ability of the ceils to multiply, the stability of the therapeutic agent, 
the physiologic need for the therapeutic agent, and the like. Generally, for myoblasts or 
fibroblasts for example, the number of cells will be at least about 104 and not more than 
about 109 and may be applied as a dispersion, generally being injected at or near the site 
5 of interest. The cells will usually be in a physiologically-acceptable medium. 

Cells engineered in accordance with this invention may also be encapsulated, e.g. 
using conventional biocompatible materials and methods, pnor to implantation into the 
host organism or patient for the production of a therapeutic protein. See e.g. Hguyen et a!. 
Tissue Implant Systems and Methods for Sustaining viable High Cell Densities within a 
10 Host. US Patent No. 5,314,471 fBaxter International, Inc.), Uludag and Sefton. 1993. J 
Biomed. Mater. Res. 27(10^:1213-24 (HepG2 cells/hydroxyethyl methacry late- methyl 
methacrylate membranes), Chang et al. 1993. Hum Gene Ther 4(4):433-40 (mouse Ltk- 
ceils expressing hGH'immunoprotective perm-selective alginate microcapsules: Reddy et 
al. 1993. J Infect Dis 168(4 i: 1082-3 (alginate), Tai and Sun, 1993, FASEB J 7(1 1 ): 1061-9 
15 (mouse fibroblasts expressing hGH/alginate-poly-L-lysine-alginate membrane), Ao et al, 
1995, Transplanataion Proc. 27(6):3349, 3350 (alginate): Rajotte et al, 1995, 
Transplantation Proc. 27(6):3389 (alginate); Lakey et al. 1995, Transplantation Proc. 
27(6):3266 (alginate); Korhutt et ai. 1995. Transplantation Proc. 27(6):3212 (alginate). 
Dorian et al. US Patent No. 5.429.821 (alginate); Emerich et al, 1993. Exp Neurol 
20 122(1 ):37-47 ^polymer-encapsulated PC12 cells); Sagen et al. 1993. J Neurosc: 
13(6):2415-23 (bovine chromaffin cells encapsulated in semipermeable polymer 
membrane and implanted into rat spinal subarachnoid space); Aebischer et ai. 1904. Exp 
Neurol 126(2): 15 1-8 ( poiymer-encapsulated rat PC 12 cells implanted into monkeys; see 
also Aebischer, WO 92 19595); Savelkoui et al, 1994. J Immunol Methods I 70(2): i S5-9o 
25 (encapsulated hybndoraas producing antibodies: encapsulated trans fecied cell lines 
expressing various cytokines); Winn et al, 1094. PNAS USA 91(61:2324-8 (engineered 
BHK ceils expressing human nerve growth factor encapsulated in an immunoisoiation 
polymeric device and transplanted into rats), Emerich et al. 1994. Prog 
Neuropsychopharmacol Biol Psychiatry 18(5) 935-46 (polymer-encapsulated PC 12 cells 
30 implanted into ratsi; Kordowcr et al, !0O 4 , PNAS USA 91(23 ):1 0898-9C2 (poiymer- 
encapsulated engineered BHK cells expressing hNGF implanted into monkeys) and Butler 
et al WO 95 04521 (encapsulated device) The cells may then be introduced m 
encapsulated form into an animal host, preferably a mammal and more preferably a human 
subject in need thereof. Preferably the encapsulating material is semipermeable, permitting 
35 release into the host of secreted proteins produced by the encapsulated cells. In many 
embodiments the semipermeable encapsulation renders the encapsulated ceils 
immunologically isolated from the host organism in which the encapsulated cells are 
introduced. In those embodiments the ceils to be encapsulated may express one or more 
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chimeric proteins containing component domains derived from proteins of the host spec.es 
and/or from viral proteins or proteins from spec.es other than the host species. For 
example in such cases the chimeras may contain elements derived from GAL4 and VP 16 
The cells may be derived from one or mere individuals other than the recip.ent and may be 
derived from a species other than that of the recipient organism or patient. 

Instead of ex vivo modification of the cells, in many situations one may wish to 
modify cells in vivo. For this purpose, vanous techniques have been developed for 
modification of target tissue and ceils in vivo A number of viral vectors have been 
developed, such as described above, which allow for transfection and. in some cases, 
integration of the vims into the host. See, for example. Dubensky et al. (198-1) Proc. Natl 
Acad. Sci. USA 81. 7529-7533: Kaneda et al., ( 1989) Science 243.375-378: Hiebert et al. 
U9S9) Proc. Natl. Acad. Sci. USA 86. 3594-3598: Hatzogiu et al. (1990) J. Biol. Chem 
265. 17285-17293 and Fern-, et al. (1991) Proc. Natl. Acad. Sci. USA 88. 8377-8381. The 
vector may be administered by injection, e.g. intravascularly or intramuscular!) . 
5 inhalation, or other parenteral mode. Non- viral delivery methods such as administration of 
the DNA via complexes with liposomes or by injection, catheter or biolistics may also be 
used. 

in accordance w.th in vivo genetic modification, 'he manner of the modification 
will depend on the nature of the tissue, the efficiency of cellular modification required, the 

0 number of opportunities to modify the particular ceils, the accessibility of the tissue to the 
DNA composition to be introduced, and the like. By employing an attenuated or modified 
retrovirus carrying a target transcriptional initiation region, if desired, one can activate the 
virus using one of the subject transcription factor constructs, so that the virus may be 
produced and transfect adjacent celis. 

IS The DNA introduction need not result in integration in every case In some 

situations, transient maintenance of the DNA introduced may be sufficient, in this way. 
one could have a short term effect, where cells could be introduced into the host and then 
turned on after a predetermined time, for example, after the cells have been able to home 
to a particuiar site 

30 In another embodiment of the invention, the transcriptional activator of tne 

invention recognizes a target endogenous gene, in which the promoter and'or one or more 
other regions of the gene has been modified to include a target sequence that is specificalK 
recognized by the DNA binding domain of a known transcription factor and tne 
transcriptional activator contains this DNA binding domain Thus, the target endogenous 

35 gene is modified to be specifically recognized by a desired transcription factor. Such an 
embodiment can be useful in situations in which no DNA binding protein is known to 
specifically bind to a regulatory region of the target gene. Thus, in one embodiment, a cell 
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is obtained from a subject and the ceil is genetically eng.r.eered in vitro to insen a desired 
regulators sequence into the promoter of the target gene. The cell can then be former 
administered to the subject. Alternatively, prior to administration of the ceil to the subject, 
the cell can further be modified to include a nucleic acid encoding a transcriptional 
« activator composing a DNA binding oomain which is capable of interacting specifically 
with the regulatory element introduced into the target gene. In another embodiment, an 
encogenous gene is modified in vivo by, e.g.. homologous recombination. 

Modification of a gene in a cell can be done, e.g.. by homologous recombination, a 
technique well known in the an. and described, e.g., in Thomas and Capecchi (1987, Celi 
,C 51 503; Mansouretal (1988. Nature 336:348; and Joyneretal. (1989) Nature 338:153. 

In another embodiment, the transcriptional activator is used to stimulate 
transcription of an exogenous gene integrated into chromosomal DNA of a subject. An 
exogenous target gene can be introduced into a subject, by obtaining a cell from a subject, 
introducing the tareet eene and optionally a nucleic acid encoding a transcriptional 
, 5 activator into the cell and administering the cell to the subject. This embodiment is useful 
in situations in which in which no DNA binding protein is known to specifically bind to a 
reeulatorv region of the tareet gene or in situations in which the target gene encoces a 
protein which is not naturally produced by a cell. For example, the target gene can be a 
rumor antigen, which is not produced by the subject under normal conditions, out which 
20 one desires to express in the subject as a vaccine antigen to prevent development ot a 
tumor expressing the tumor antigen. 

Exogenous genes can also encode antisense RNA or nbozymes or other RNA 
molecules which are not translated For example, the method of the invention can be used 
to inhibit Deduction of one or more specific proteins in a cell of a subject. The 
25 availability of potent transcriptional activators provided by the invention will ensure that 
high levels of RNA. e.g.. antisense RNA. are produced in a cell 

In a preferred embodiment of the invention, the transcriptional activator is a 
complex composing a first fusion protein having multiple activation units and a iigand 
binding domain, a second fusion protein having a DNA binding domain and a iigano 
30 binding domain, and a iigand which interacts simultaneously with both iigand binding 
domains. Thus, activation of transcription of a target gene is stimulated oniy in tr.e 
presence of the iigand. e.g.. dimertzmg agent. Accordingly, expression of the target gene 
in a subject is stimulated only upon administration of the Iigand to the subject 

The dimenzinc lieand ma> be administered to the patient as desired to activate 
« transcription of the targe: gene Depending upon the binding affin.n of the Iigand. tr.e 
response desired, the manner of administration, the half-life, the number o: cells present. 
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various protocols may be employed. The I.gand may be administered parenteral* or 
orallv. The number of administrations w.ll depend upon the factors described above. The 
Hgand mav be taken orally as a pill, powder, or dispersion: bucally. sublingual*: injected 
intravascular*, intraperitoneally, subcutaneously; by inhaiauon. or the like the hgana 
« (and monomenc antagonist compound', may be formulated using conventional methoas 
and materials well know in the an for the various routes of administration. Tne precise 
dose and particular metnod of administration will depend upon the above factors and be 
determined by the anending physician or human or animal healthcare provider. For the 
most part, the manner of administration will be determined empirically. 
0 In the event that transcriptional activation by the Hgand is to be reversed or 

terminated, a monomenc compound whicn can compete with the dimenzing hgand may be 
administered. Thus, in -die case of an adverse reaction or the desire to terminate the 
therapeutic effect, an antagonist to the dimenzing agent can be administered in any- 
convenient way. particularly intravascular*, if a rapid reversal is desired. Alternatively. 
l< one mav provide for the presence of an inactivation domain tor transcnptional silencer, 
with a DNA binding doma.n. In another approach, cells may be eliminated through 
apoptosis via acnalinc through Fas or TNT receptor as described elsewhere. See 
International Patent Applications PCT/US94/01617 and PCT.US94/08008. 

The particular dosage of the Hgand for any application may be determined in 
-o accordance with the procedures used for therapeutic dosage monitoring, where 
maintenance of a particular level of expression is desired over an extended period of times, 
for example, greater than about two weeks, or where there is repetitive therapy, with 
individual or repeated doses of hgand over short periods of time, with extended intervals, 
for example, two weeks or more. A dose of the hgand within a predetermined range 
;< would be given and monitored for response, so as to obtain a time-expression level 
relationship, as well as observing therapeutic response. Depending on the levels observed 
dunne the time per.od and the therapeutic response, one could prov.de a larger or smaller 
dose me next ume. following the response. This process would be iterative* repeated 
until one obtained a oosage within the therapeutic range Where the Hgand is chronically 
*0 administered, once the maintenance dosage of the I.gand is determined, one could then do 
assays at extended intervals to be assured that the cellular system is providing the 
appropriate response and level of the expression product. 

It should be appreciated that the system is subject to many variables, such as the 
cellular response to the I.gand. the efficiency of expression ana. as a PP ro P r.ate. the level of 
is secretion, the activits of the expression product, the particular need of the patient, which 
mav van, wnth ume and circumstances, the rate of loss of the cellular activity as a result of 
loss of cells or expression activity of individual cells, and the like. Therefore, it is 
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expected that for each individual patient, even if there were universal cells which could be 
administered to the population at large, each patient would be monitored for the proper 
dosage for the individual. 

In one embodiment of the invention, methods of introduction of an hgand. e.g.. 

< dimer.zine agent. and<or a cell modified according to the method of the invention to 
svnthesize hieh levels of protein from a target gene may be provided by recnargable or 
biodegradable devices. Various slow release polymeric devices have been developed and 
tested in vivo in recent vears for the controlled delivery of drugs, including protemac.ous 
biopharmaceuticais. A variety of biocompatible polymers (including hydrogels). 

0 including both biodegradable and non-degradable polymers, can be used to form an 
implant for the sustained release of a d.merizer or a protein produced by a cell modified 
according to the method of the invention at a particular target site. Such embodiments of 
the present invention can be used for the delivery of an exogenous* purified proteir. 
produced accorcme to the method of the invention, which has been incorporated in the 

,5 polymeric device, or for the delivery of a protein produced by a cell encapsulated in the 
polymeric device 

An essential feature of certain embodiments of the implant can be the linear release 
of the dimerizer or protein produced by the encapsulated cell which can be achieved 
through the manipulation of the polymer composition and form. By choice of monomer 
composit.on or polvmer.zation technique, the amount of water, porosity and consequent 
permeabilitv characteristics can be controlled. The selection of the shape, size, polymer, 
and method for imoiantaiion can be determined on an individual basis according to the 
disorder to be treated ar.d the individual patient response. The generation of such implants 
is generally known m the an. See. for example. Concise Encylopedia of Medical & Denta! 
Materials, ed. by David Williams (MIT Press: Cambridge. MA. 1990); and the Sabei ct ai. 
U.S. Patent No. 4.883.666. In another embodiment of an implant, a source of cells, 
modified accordinc to the method of the invention, producing a desired protetn. or a 
solution of hvdogel matrix containing punfed protein or dimenzer. is encapsulated m 
.mplantabie hollow fibers Such fibers can be pre-spun and subsequently loaded with the 
protein source or dimenzer (Aebischer et al U.S. Patent No. 4.892.538. Aebischer et a! 
U.S. Patent No. 5.106.627; Hoffman et al. (1990) Expt. Neurobiol. 1 1 0:39-44; jaeger eta. 
(1990. Prog. Brain Res 82:41-46. and Aebischer et al. (1991) J. Biomech. Eng. 113:178- 
183) or can be co-extruded with a polymer which acts to form a polymeric coat about the 
cell protein or dimenzer (Lim U.S. Paten. No. 4.391.909; Sefion U.S. Patent No 
4 ->3 888; Sugamon et al. 0989) Trans. .Am. Artif. Intern. Organs 35:791-790; Senon et 
al. .1987) B.otennoi. Bioeng. 29:! 135-1 143. and Aebischer et ai. (1991. B.omatenais 
12:50-55). 
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Thus, the method of the invention can broadly be applied to any situation, e.g.. fo: 
treating or preventing any disease or condition, in which transcriptional activation of an 
integrated targe: gene is desired. Depending on the specific embodiment of the invention, 
a trlnscnouona; activator, nucleic ac.d encod.ng such, target exogenous gene, and or 
< ol.comenzer is administered to a subject. These an be administered as such or together 
with a delivery vehicle, e.g., liposomes Whether with or without a delivery vehicle, these 
compounds are preferably administered together with a pharmaceut.caUy acceptable 
earner. Methods of administration of these compounds are known in the an and are 
briefly disclosed below 

,0 Toxicity and therapeutic efficacy of such compounds can be determined by 

standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD ADVANCE \d 1 50 ADVANCE \u 1 (the dose lethal to 50% of the 
population., and the EDADVANCE >.d 1 50 ADVANCE \u 1 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects 
!5 is the therapeutic index and it can be expressed as the ratio LD ADVANCE \d 1 
50ADVANCE a 1 /EDADVANCE \d 1 50ADVANCE \u 1 . Compounds which exhibit 
large therapeutic .nd.ces are preferred. While compounds that exhibit toxic side effects 
may be used, care should be taken to design a delivery system that targets such compounds 
to the site of affected tissue in order to minimize potential damage to uninfected cells and. 
20 thereby, reduce side effects. 

The data obtained from the cell culture assays and animal stud.es can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the EDADVANCE \d 1 
50 AD VANCE u 1 with little or no toxicity. The dosage may vary within this range 
=5 depend.ng upon the dosage form employed and the route of adm.n.stration utilized. For 
any compound used in the method of the invention, the therapeutically effective dose can 
be estimated initially from cell culture assays. A dose may be formulated in animal 
models to achieve a circuiat.ng plasma concentration range that includes the 
ICADVANCE d 1 50 ADVANCE \u 1 (i.e., the concentration of the test compound which 
30 achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 
information can be used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance liquid chromatography. 

Pharmaceutical composiuons lor use m accordance with the present invention may 
be formulated m conventional manner using one or more physiologically acceptable 
35 earners or excipients. Thus, the compounds and their physiologically acceptable salts and 
solvates may be formulated for administration by. for example, injection, inhalation or 
insufflation "(either through the mouth or the nose) or oral, buccal, parenteral or recta: 
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administration. 

For such therapy, the compounds of the invention can be formulated for a variety 
of loads of administration, including systemic and topical or localized administration. 
Techniques and formulations generally may be found in Remmington's Pharmaceutical 
Sciences. Meade Publishing Co.. Easton. PA. For systemic administration, injection is 
preferred, including intramuscular, intravenous, intraperitoneal, and subcutaneous. Fo: 
injection, the compounds of the invention can be formulated in liquid solutions, preferably 
in phvsioloEically compatible buffers such as Hank's solution or Ringer's solution. In 
addition, the compounds may be formulated in solid form and redissolved or suspended 
immediately pnor to use. Lyophilized forms are also included. 

For oral administration, the pharmaceutical compositions may take the form of. for 
example, tablets or capsules prepared by conventional means with pharmaceutical 
acceptable cxcipients such as binding agents (e.g.. pregelatmised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose): fillers .e.g. lactose, 
microcrystalline cellulose or calcium hydrogen phospnate): lubricants (e.g.. magnesium 
stearate! talc or silica); aisintegrams (e.g.. potato starch or sodium starch glycolatei; or 
wemng agents (e.g.. sodium lauryi sulphate). The tablets may be coated by methods well 
known in the art. Liquid preparauons for oral administration may take the form of. for 
example, solut.ons. syrups or suspensions, or they may be presented as a dr> product for 
constitution with water or other suitable vehicle before use. Such liquid preparations may 
be prepared by conventional means with pharmaceutically acceptable additives such as 
suspending agents (e.g.. sorbitol syrup, cellulose derivatives or hydrogenated edible fats), 
emulsifying agents v e.g.. lecithin cr acacia); non-aqueous vehicles (e.g., almond oil. oily 
esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g.. meiml or 
propyl-p-hydroxvbenzoates or sorbic acid). The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably fomiulated to give controlled 
release of the active compound. For buccal administration the compositions may take the 
form of tablets or lozenges formulated in conventional manner. For administration b> 
inhalation, the compounds for use according to the present invention are convenient!;, 
delivered in the form of an aerosol spray presentation from pressurized packs cr a 
nebuliser. with the use of a suitable propellant. e.g.. d.chlorodifluoromethar.e. 
tnchiorofluoromethane. dichiorotctrafluoroethane. carbon dioxide or other suitable gas. In 
the case of a pressurized aerosol the dosage unit may be determined by providing a valve 
to deliver a metered amount. Capsules and cartridges of e.g. gelatin for use in an inhaler 
or insufflator may be formulated containing a powder mix of the compound and a suitable 
powder base such as lactose or starch. 
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The compounds may be formulated for parenteral administration by injection, e.g.. 
nv bolus iniect.on or continuous infusion. Formulations for injection may be presented in 
unit dosaee form. e.g.. in ampoules or in mult-dose containers, war. an adaec 
preservative. The composiuons may take such forms as suspensions, solutions or 
< emuls.ons in o,lv or aaueous vehicles, and may contain formulatory agents such as 
suspending stabilizing and<or dispersing agents. Alternatively, the active ingredient may 
be in powder form for constitution with a suitable vehicle, e.g.. stenle pyrogen-tree water, 
before use. 

The compounds may also be formulated in rectal compositions such as 
,o suppositories or retention enemas, e.g.. containing conventional suppository bases such as 
cocoa buner or other glycendes. 

In addition to ihe formulations described previously, the compounds may also be 
formulated as a aepot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneous* or intramuscularly) or by intramuscular 
,< injection. Thus, for example, the compounds may be formulated with suitable polymeric 
or hvdrophobic materials (for example as an emuls.on in an acceptable oil) or .on 
exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soiuoie 
salt. 

Svstemic administration can also be by transmural or transdermal means. For 
20 transmural or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. and 
include, for example, for transmural administration bile salts and fus.dic acid 
derivatives. In addition, detergents may be used to facilitate permeation Transmural 
administration may be through nasal sprays or using suppositories. For topical 
25 administration, the oligomers of the invention are formulated into ointments, salves, geis. 
or creams as generally known in the an. 

In clinical settings, the gene delivery systems for the genes encoding 
transcriptional activators and optionally target gene can be introduced into a patient by any 
of a number of methods, each of which is familiar in the an. Fcr instance, a 
30 pharmaceutical preparation of the gene delivery system can be introduced systemically. 
e g bv intravenous injection, and specific transduction of the protein in me target cells 
occurs predominantly from specificity of transection provided by the gene delivery 
vehicle, cell-tvpe or tissue-ty pe expression due to the transcriptional regulator, sequences 
controlling expression of the receptor gene, or a combination thereof In other 
« embodiments, initial delivery of the recombinant gene is more limited with introduction 
into the animal being quite localized. For example, the gene delivery ven.cle can re 
introduced bv catheter i see U.S. Patent 5.328.4701 or by stereotactic injection .e.g. Chen et 
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al. (1994) PNAS 91: 3054-3057). 

The pharmaceutical preparat 10 n of the gene therapy construct can con S1 st 
essentialK of the gene deHvery system in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
5 complete gene del-very system can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can comprise one or more cells which 
produce the gene delivery system. 

The compositions may, if desired, be presented in a pack or dispenser device which 
mav contain one or more unit dosage forms containing the active ingredient The pack 
10 may for example comprise metal or plastic foil, such as a blister pack. The pack or 
dispenser device mav be accompanied by instructions for admimstranon. 

The present invention is further illustrated by the following examples which should 
not be construed as limning in any way . The contents of all cited references including 
literature references, issued patents, published patent applications as cited throughout this 
,5 application are hereby expressly incorporated by reference. The practice of the present 
invention will employ, unless otherwise indicated, conventional techniques of cell biology, 
cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA. and 
immunology, which are within the skill of the art. Such techniques are explained fully in 
the literature. See. for example. Molecular Cloning* A Laboratcry+Manual. 2nd Ed., ed. b> 
2C Sambrook. Fritsch and Maniar.s (Cold Spring Harbor Laboratory Press. 1989.; DNA 
Cloning. Volumes i and II ,D N. Glover ed.. 1985): Oligonucleotide Synthesis (M. J. Gait 
ed.. 1984); Mollis et al. U.S. Patent No: 4.683,195; Nucleic Acid Hybridization ,B. D. 
Hames & S. J. Higgins eds. 1984); Transcription And Translation <B. D. Hames A. S 
Higgms eds. 1984,. Culture Of .Animal Cells (R. I. Freshney. Alan R. Liss. Inc.. 1Q87j: 
25 Immobilized Cells And Enzymes (IRL Press. 1986,; B Perbal. A Practical Guide fc 
Molecular Cloning .1984';; 'die treatise. Methods In Enzymology (Academic Press, inc . 
N.Y.): Gene Transfer Vectors For Mammalian. Cells (J. H. Miller and M P. Calos eds.. 
1987. Cold Spring Harbor Laboratory): Methods In Enzymology. Vols. 154 and 155 (Wu 
et al eds.). Immunochemical Methods In Cell And Molecular Bioiog;. ( Mayer and 
-0 Walker, eds.. Academic Press. London. 1987). Handbook Of Experimental Immunology . 
Volumes I-IV (D. M Weir and C. C. Blackwell. eds.. 1986): Manipulating the Mouse 
Embryo. (Cold Spring Harbor Laboratory Press. Cold Spring Harbor. N.Y.. 1986) 
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Exemplification 

The invention now being generally described, it will be more readily understood b> 
reference to the following examples which are induced merely for purposes of illustration 
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cf certain aspects and embodiments of the present invention, and axe not intended to limit 
the invention 

Example 1 : Stimulation of High Level Transcription of an Integrated Gene 

This Example shows that a gene integrated into chromosomal DNA is immune 
from the effect of squelching and that high level transcription of an integrated gene can be 
obtained by delivering multiple activation domains to the gene. 

First, the effect of increasing concentrations of a transcriptional activator on an 
integrated versus a non-.ntegrated reporter gene was compared as follows. Two constructs 
0 encoding chimeric transcriptional activators under the control of the CMV 
enhancer/promoter were used. The first construct (pCG-GV) encodes a chimeric 
transcription factor composed of the yeast GAL4 DNA-binding domain (amino acids 1 -94> 
and activation domain (ammo acids 410-490) derived from the herpes simplex virus 
protein VP 16 (Sadowski. 1.. et ai. (1988) Nature 335:563-4). The second construct (pCG- 
,5 CS) encodes a chimeric transcription factor composed of the yeast GAL4 DNA-binding 
domain (amino acids 1 -94) and the activation domain (amino acids 361-550) from the NF- 
B P 65 protein .Ballard. DAW. et al. (1991) Proc. Natl. Acad. Sci 89:18^5-1879, 
Schm.tz. M.L. and Baeuerle. P.A. (1991) EMBO J. 10:3805-3817). The effect of these 
transcription factors was tested on a target gene composed of a secreted alkaline 
20 phosphatase (SEAP) reporter under the control of a minimal human IL-2 gene promoter 
flanked by five GAL4 binding sites. 

HT1080 cells were transiently transfected with the reporter construct and or a 
construct encoding a clr.menc transcription factor as follows. HT1080 ceils were grown at 
37 C in MEM medium containing 10% fetal calf serum. non-essentiaJ ammo acids and 
- penicillin-streptomvcin. Twenty-four hours before transection, approximately 2X10? 
cells were seeded in each well in a 12-well plate. Cells were transfected using 
Lipofectamine as recommended (Gibco BRIA Cells in each well received the amount of 
plasmids indicated m the figure, with or without 400 ng of reporter plasmid. with the total 
amount of DNA being aa;usted to 1.25 ug with pUC19. Five hours later, the medium was 
30 removed and 1 ml of fresh medium added 1 8-24 hours later. 100 ui medium was removed 
arid assayed for SEAP activity using a Luminescence Spectrometer (Perkm Elmer) at 35C 
nm excitauon and 450 nm emission 

HT1080 cells were stabiy transfected with the reporter construct pLH-5xGAL4- 
IL2-SEAP as follows. The retroviral vector P LH-5xGal4-IL2-SEAP was constructed by 
;-5 cloning the 5xGAL4-IL2-SEAP fragment descr.bed above into the vector pLH ( Rivera. 
Y.M.. et al. (1996) Nature Medicine 2:1028-1032). which also contains the hycromycin B 
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resistance gene driven by the Moloney murine leukemia virus long terminal repeat. 
Helper-free" retrovirus, generated as described (Rivera. V.M.. et al. Nature Medicine 
2.1028-1032 (1996)) was used to infect HT1080 cells. Hundreds of hygromycin B f?00 
Mg/'ml) resistant clones were pooled (HT1080 B pool) and individual clones screened by 
5 transient transferor, wtth P CG-GS. The most responsive clone. HT1080B. was selected 
for further analysis 

When the reporter gene and expression plasmids encoding the transcription factors 
were both introduced transiently into HT1080 human fibrosarcoma cells, peak reporter 
gene expression at relatively low levels of input activator piasmid was observed, whue 
i0 higher levels of activator led to a sharp reduction in reporter gene expression (Fig 1 A) 
This inhibiuon of gene expression in the presence of high levels of activator has been 
widely observed, and the phenomenon has been termed "squelching." On the contrary, 
expression of the integrated reporter gene was not inhibited by levels of activator that 
sharply squelched a transiently transfected reporter (Fig. IB). 
15 To rule out the possibility that the immunity of the integrated gene to squelching 

reflects the effect of a single unusual integration site, this experiment was repeated using a 
pool of several hundred HT1 080-denved clones harboring independent reporter gene 
insertions. As shown in Figure 1C. expression of this population of inserted reporter genes 
was not inhibited by high levels of activator. A second reporter construct in which the 
20 same GAL4-driven IL-2 promoter was fused to a human growth hormone (hGH; reporter 
gene was constructed by replacing the SEAP gene with the hGH gene. This plasm.d was 
cotransfected with tne GAL4-p65 expression piasmid into cells containing an integrated 
SEAP reporter gene, thus allowing both reporter genes to be assayed in the same cell 
population. Transections and reporter gene expression were measured as described 
25 above. For determining expression of the hGH reporter construct. 2-5 ul of medium was 
assayed for hGH protein as recommended (Nichols Diagnostic). 

If. in the presence of a high-copy episomal template. GAL4-p65 titrates GTFs 
necessarv for the activity of this promoter, then one would expect to observe inhibition of 
both the episomal hGH gene and the integrated SEAP gene. However, the results indicate 
30 that, whereas expression of the episomal hGH gene was inhibited at high activator 
concentrations. :he integrated SEAP gene responded identically whether or not the 
episomal gene was present (Fig. 2V Similar results were obtained in parallel experiments 
with GAL4-VP16. This observation indicates that one or more mechanistic steps m tne 
transcnpuon cycle differ between episomal and integrated genes. 



35 



Example 2: Transcription of an Integrated Gene Increases Proportionally to the 
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Number of Activation Domains Delivered to the Gene 

This Example shows that expression of an integrated gene car. be driven to very 
high levels by delivering many potent activation domains to the promoter of the gene. 

A modular strategy was designed to deliver different numbers of activation 
< domains to a single integrated target gene. This strategy was based on the ability ot a 
small-molecule "dimenzer" to recruit activation domains to a DMA-bound receptor 
(Belshaw P.J.. et al. (1996) Prcc. Natl. Acad. Sci 93:4604-4607; Rivera. V.M., et al. 
(1996) Nature Medicine 2:1028-1032: Ho. S.N.. et al. (19%) Nature 382.822-824 , Tne 
basic svstem. which is diagrammed in Fig. 3A. was composed of a GAL4 DNA-binding 
,o domain fused to a single copy of human FKBP12 and a p65 activation domain fused to the 
FRB domain of FRAP (Ho. S.N.. et al. . supra: Sabauni. D.M.. et al. (1994) Cell 78:35- 
43) In the presence of tne natural-product immunosuppressive compound rapamyem. the 
FRB-p65 fusion prote.n is efficiently recruited to the GAL4-FKBP fus.on protein. This 
basic svstem results in the delivery of a maximum of one p65 activation domain per GAL-i 
l< monomer The number of deliverable activation domains was increased in rwo ways: (i » 
bv increasine the number of FKJ3P moieties fused to GAL4. as indicated in Fig. 3B; and 
(ii) through the use of a tetramerization domain derived from the E. coli lac repressor 
(Chakenan. A.E.. et al. (1991 I J Biol Chem 266:1371-4: Alberta. S.. et al. (1993) EN1BO 
j 12: 3227-36. Lewis. M. et al. (1996) Nature 2-1:1247-1254) to deliver "bundles' oi 
,0 four" actuation domains to each FKBP moiety, as shown in Fig. 3C. These different 
configurations allowed recruitment of up to sixteen p65 activation domains to a s.ngle 
GAL4 monomer 

These expression constructs were prepared as follows. pCGNN-G expression 
vector was made by inserting a PCR fragment containing the GAL4 DNA binding domain 
25 (amino acids l-«4) flanked by upstream. Xbal and downstream Spel and BamHI sites into 
Xbal- and BamHI- digested pCGNN (Ricardo Ar.ar. please provide a reference) FKBP12 

codine secuence (amino acids 1-107). described in flanked by upstream Xbal and 

downstream Spel and BamHI sites was inserted between the Spel and BamHI s.tes ot 
pCGNN-G to generate oCGNN-GFl. Plasm.ds pCGN*N-GF2, GF3 and GF~ were made 
30 bv the sequential insertion of FKBPI2 coding sequences into Spel- and BamHI d.gestea 
pCGVN-GFl. 2 ar.d 3 plasmids. respectively. PCGNN-RL was constructed by cloning an 
Xbal-BamHl fraement containing a portion of the E. coli iactose repressor ,am.no acids 
46-360) into Spel- and BamHl-digested pCGNN-R (Rivera. V.M.. et al. supra, to ruse it to 
the carboxv terminus of the FRB domain. The p65 activation domain was tusec to the 
35 carboxv terminus of this chimera by inserting an Xbal-BamHl fragment ,ntc Spel- and 
BamHI digested pCGNN-RL to create pCGNN-RLS 

Various comb.nauons of expression constructs, i.e.. 10 ng of plasmid expressing 
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DNA binding domain fusions and increasing amounts of plasmid expressing po_^ 
activation domain fusions; were transfected into HT1080B cells carrying an integrated 
re?ort er gene. The cells were then treated with 10 nM rapamycin to deliver the activaiion 
domains to the target gene and the amount of SEAP was measured in the supernatant. As 
5 shown in Figure 4B and 4C. the results indicate a roughly linear increase in reporter gene 
exrression as a function of the number of activation domains that are delivered to a single 
GAL4 monomer. In contrast, except for the increment from one to two activation domains 
per GAL4 monomer, increasing the number of activation domains that can be delivered to 
an episomai target gene (introduced by transient transection) failed to enhance the peak 
10 levei of gene expression achieved before squelching occurred (Figures 4B and 4C) 

These results indicate thai expression of the episomai reporter gene can be driven 
only to a fixed level before squelching sets in, effectively establishing a ceiling for 
expression from the episomai gene Because squelching does not impose such a limitation 
on the integrated gene, expression can be substantially augmented by delivering more 
:5 activation domains, indicating that expression of an integrated chromatin-embedded gene 
is limited largely by the number (or strength) of activation domains bound at the promoter. 
Furthermore, the ievei of expression reached by the integrated gene is considerably higher 
than that attained by the episomai gene, especially when considered on a per-tempiate 
basis, as shown in Figure 4C. 

20 

Example 3: Transcriptional Activation Units can Synergize to Activate 
Transcription of an Integrated Target Gene 

This Example shows that specific combinations of transcriptional activation 
domains can result in potent transcriptional activators capable of strong transactivation o: 
25 integrated single copy genes. 

The effect of various constructs encoding GAL4 DNA binding domain fusion 
proteins on transcription of an integrated P LH-5xGal4-IL2-SEAP vector in HT1080 ceils 
(clone HT1080B) was determined. The activation domains fused to the GAL 4 DNA 
binding domain < ammo acids 1-Q4> consisted of ammo acids 18x4 of QUI iGQIIIi. amino 
30 acids 1-92 of p53 tGp53», amino acias 417^90 of VP16 (GYP16), ammo acids 450-550 
of p65 <Gp65i. ammo acids 399-499 of CTF (GCTF), ammo acids 411-508 of SRF 
(GSRF). or ammo acids 263-499 of SP1 (GSP1 V 

Various amounts of these constructs were transiently transfected into the HT10S0B 
cell line having a single integrated copy of the vector p LH-5xGal4-IL2-SEAP and the 
35 level of expression of the reporter gene was measured as described above The results, 
which are represented in Figure 6. indicate that among the various fusion proteins, on;;, 
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GAL4-VP16 and GAL^p65 activate transcription of the integrated single cop> reponer 
gene. 

Next the transaction potential of combinations of transcriptional activation 
domams were tested on a single copy integrated gene. Expression constructs encoding « 
« GAL4 DN-A bindina domain linked to 1. 2. or 4 VP16 activation domains (ammo adds 
41 7.490, or 1 " 3. or 4 p65 activation domains (.ammo acids 450-550) or a p6; activation 
domain fused to a VP16 activation domain. The results are presented in Figure 7 and 
indicate that a multirlictv of P 65 activation domains or VP16 activation domains do not 
provide hiener levels of transcnpuon relauve to a transenpuona! activator containing a 
r s.nele p6>"or VP 1 6 activation domain In contrast, the activation domams of P 6> and 
VP 1 6 are capable of cooperating in transcriptional activation of an integrated target gene. 

In another experiment, the transcriptional potential of combinations of 
transcriptional activation domains of VP16 on a single copy integrated gene was 
determined Expression constructs encoding the following fus.on proteins containing a 
K GAL4 DNA bindme doma.n (amino acids 1-94) were prepared and tested in transient 
transections of the HT1080 cell clone containing a smgle integrated vector P LH-5xGal*- 
IL-SEAP- fusion proteins containing 2. 4. 8. or 12 copies (GV8X2, GV8X4. GV8X8. anc 
GV8XP respecuveiv, of an e.ght amino acid variant of a transactivation domain from 
VP16 havine the ammo acid sequence DFDLDMLG (referred to herein as "V8" peptide,. 

~ i -« i a f nr comes (GVCX1. GVCX2. GVCX3. GVCX-. 

-0 fusion proteins containing 1. -. 4. i. or o copies (u v,^i. fVP -, 
GVCX- GVCX6 respecuveiv, of a transactivation domain from the C-terminus of V : 
corresponding to ammo acids 461-490 (referred to herein as "Vc peptide): and fusion 
protems containing 8 cop.es of V8 and 5 cop.es of Vc. wherein the V8 cop.es arc fusee 
Either to the N-termmus or the C-term.nus of Vc. The results, which are presents m 
Figure 8 panels A and B. indicate cooperative between V8 peptides up to about 8 copies 
0--V8 and up to 5 coo.es of Vc. H.gher numbers ofcop.es of V8 and Vc result in reduce, 
transcriptional act.vuv of the reporter gene. However, as shown in Figure 8. P anel C. a 
combination of 8 copies of V8 fused to the N-term,nus of 5 cop.es of % c. resets >n 
stroneer transcriptional levels than that obtained with either 8 cop.es of V8 or > cop,es of 
,0 Vc This transcrmuonai level >s even h.gher than that obtained with full length X P16. 
Int-restmalv however, a fusion protein containing 8 cop.es of V8 fused to the C-term.nus 
of < copies of Vc does not result in higher transcriptional activity ot tne reporter 
constructs, relauve to 8 cop.es of V8 and 5 co P .es of Vc. Thus, specific arrangements ot 
transcriptional act.vat.on domams are more potent in activating transcnpuon man other 
35 arrangements. 

Thus this example demonstrates that combination of multiple activation domams 
can result in potent transcriptional activators capable of stimulating transcnpuon ot a 



PCTOJS97/15219 

WO 99/10508 

-65 - 

sineie copy integrated gene. 
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Example 4: An Alanine/Proline Rich Domain of p65 is Capable of Strongly 
Transacting a Target Gene, when in the Presence of Another Transaction 
Domain 

This Example demonstrates that a specific domain of g P 65 is incapable to stimulate 
transcnpt.on of a target gene on its own. but can synergize with other transcriptional 
activation domains to stimulate transcription of a target gene. 

Analysis of the p65 amino acid sequence revealed the presence of two 
Alanine/Proline rich reuions in p65. The first domain, i.e., domain I. consists of amino 
acids 376-401 and has the amino acid sequence SALALAPAPPQVLPQAPAPAPAPAMY 
(SEQ ID NO ). The second domain, i.e.. domain II. consists of amino acids 402-427 and 
has the amino acid sequence SALAQAPAPYPVLAPGPPQAVAPPAP (SEQ ID NO...: 
The transcriptional potential of these sequences was investigated as follows Constructs 
encoding fusion proteins containing GAL4 DNA binding domain (amino ac.ds 1-92) and 
the following domains were prepared for use in transient transfection: fusion proteins 
containing 1. 2. or 3 copies of amino acids 361-450 of p65 containing the two above- 
described alanine, proline nch regions and referred to herem as the AP domain: fusion 
proteins containing 1. 2. or 4 copies of the SP1 activation domain; and fusion proteins 
containing an SP1 activation domain fused at its N-terminal or C-terminal region to one or 
two AP domains 

The amount of reporter gene expression obtained upon transfection of these 
constructs into the HT1080B cell line containing an integrated copy of the vector pLH- 
5xGal4-lL2-SEAP is shown in Figures 9A and B. These results indicate that 2 ana 4 
copies of the AP domain does not result in significant transact.vation of the reporter gene, 
in contrast to an increase in transcription proportional to the number of Spl domains added 
(Figure 9A). However, the combination of one or more AP domains with an Spl 
actuation domain synergisticatiy activated transcription of the reporter construct (Figure 
9BV Interestingly, the synergy appears to be stronger when the AP domain .s linked to the 
N-terminus of the Spl activation domain. Thus, a combination of transcriptional 
activation domains results in transcriptional activators which can be much more potent 
than naturally occurring transcription factors. 

Yet other combinations of transacttvation domains were tested for their effect on 
transcnpt.on of an integrated single copy gene. The fusion proteins tested comprised a 
GAL4 DNA binding domain and one or more V8 domains, one or more Yc domains, 
and/or one or more AP aomains. Expression constructs encoding these fusion proteins 
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were transiently transacted into the HT1080 cell line containing an integrated copy of the 
vector P LH-5xGal4-IL2-SEAP. The results are shown in Figures 10A and 10B. These 
results show that the AP domain potentiates the transcriptional activation of various 
combinations of transcriptional activation domains However, the .AP domain has a 
< stroneer potentiatine effect on some activation domains than on others. In particular, as 
shown in Figure 10A. AP synerg.zes with V8, but increases only slightly transcription by 
Vc While not wishing to be bound by any particular theory, we note that one possible 
explanation for this phenomenon is that both the AP domain and Vc interact with the same 
GTF. i.e.. TFI1A. whereas V8 interacts with a different GTF. Thus, it is possible that 
,0 svnergv between different transcriptional activation domains is induced when the 
activation domains mteract with different GTFs. Figure 10A further shows that certain 
combinations of activation domains, such as AP-V8x8-Vcx2 and AP-V8x8-AP-Vcx2. are 
capable of stimulating transcription of the reporter gene to higher levels than VP 16 and 
even P 65. which is one of the most potent naturally occurring transcription factors known, 
i 5 Figure 1 OB shows the results of transfections with yet other fusion proteins having 

various combination of transact, vaung domains. In particular. Figure I OB shows that the 
transcriptional activity of a transcnptional activation domain of p65, corresponding to 
ammo acids 361-550 (containing the AP domain) is further potentiated by the addition of 
one or more other transact! vat ion domains, such as an additional AP domain and/or V8 or 
20 Vc domains. 

In yet another set of transfections. constructs encoding fusion proteins having 
activation domains fused to three copies of the ligand binding domain FKBP, DNA 
binding domains fused to ligand binding domains (FKBP or FRB in a single or mutl.pie 
copies) were used. In these transfections. transection was induced by addition of a 
^ ligand. e.g.. rapamvon. FK1012, AP1510, or other synthetic dimenzer (depending on the 
ligand bmding domain) The results obtained were similar to those obtained with the 
covalent system. 

Thus, this Example shows that potent transcnptional activators can be created b> 
the combination of various transcnptional activation domains and that these transcriptional 
30 activators stimulate transcription of an integrated single copy gene. 
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All of the above-cued references and publications are hereby incorporated by 
reference. 



Equivalents 

Those skilled in the an will recognize, or be able to ascertain using no more than 
routine experimentation, numerous equivalents to the specific polypeptides, nucleic acids, 
methods, assays and reagents described herein. Such equivalents are considered to be 
within the scope of this invention. 
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SEQUENCE LISTING 

.:. INFO? MAT I ON FOP- SEQ ID NC:1: 

i SEQUENCE CHARACTERISTICS: 

; A . LENGTH: 24 44 Dase pairs 
(El TYPE: nucleic acid 
(C: STFANOEDNESS : both 
]0 .; D ) TOPOLOGY: linear 

(ii MOLECULE TYPE: C ON A 

IS i ix FEATURE: 

;A NAME 'KEY: COS 

( B : LOCATION : 3 9 . . 164 9 

20 ;xi. SEQUENCE DESCRIPTION: SEQ 10 NO:i: 

-0A3GC GGGGCCGGGT CCCAGCTGGG CCCGCGGC ATG GAC GAA CTC- TO 



25 



Pro Leu lie ?Ae Pro AiT Git Gin Pre Lys Gin Arc Gly Met Arg Phe 



Met Asp Glu Leu Fne 

1 - 

AAG CAG CGG GGC ATG 



30 -3- -A- AAG TGP 3 AG GGG CGC TOO GCG GGC AGO ATC CCA GGC GAG AGG 
' A~ Tvr Lvs Cvs Glu Gly Arc Se: Ala Gly Se: lie Pro Gly Glu Arc 

AGC ACA GAT AC - ACC AA G ACC CAC CCC ACC ATC AAG ATC AAT GGC TAG 
3< Ser Thr Asp Tr.r Tr.r Lys Tr.r Krs Pro Thr lie lys I.« .sr. o-y .yr 
4C <- 30 

ACA GGA CCA GGG ACA GTG CGC ATC TCC CTG CTC ACC AAG GAC CCT CCT 

Thr Gly Pro Gly Tr.r va. Aro I.e Ser _eu '.a. .. r -ys «.„ ?r- --c 

40 SS 60 

_ „_ - r , CAC ccc CAC GAG CTT GTA GGA AAG GAC TGC CGG GAT GGC 

H-s Arc Pro Hi. Pre H:s Glu leu Vai Giy lys Asp Cys Aro Asp M ly 

4 " _„ --- --3 GCT 3AG CTC TGC CCG GAC CGC TGC ATC CAC ACT TTC CAG 
Tvr gIu Ala G.u leu Cys Pre Asp Arc Cys lie His Ser Phe ^r. 

Q £ 9b * u ^ 

SO AAC CTG GGA ATC CAG TCT GTG AAG AAG CGG GAC CTG GAG CAG GCT A 
lie - ' - ' - " - r ' 

1C 5 



AAC CTG GGA ATC CAG TCT GTG AAG AAG CoG ^. - o ? o ^ ? - 
Asn Leu Gly He 31r. Cys Val Lys Lys Arc Asp u.u . 

110 - 1 - 



AGT CAG CGC ATC CAG ACC AAC AAC AAC CCC TTC CAA GTT CCT ?; „ ^ 

<S Gin Arc He Sir. Thr Asr. Asr Asn Pre Fne Gin Va. :'rc _e 

~ ' 12: — 

GA3 "A, "~G~ GG-- GAC TAC GAC CTG AAT GCT GTG CGG CTC TGC TTC CAG 

Glu Arc G.v As p Tyr Asr Leu Asn Ala Val Arc Leu Cys P,e G.n 

GTG ACA GTG CGG GAC CCA TCA GGC AGG CCC CTC CGC CTG CCG CCT GTC 

Va: Thr Val Arc Asr, Pro Ser Gly Arc Pro Leu „rc -eu Pre ,.c 

ice is? 

65 



Leu Ser His Pre lie Phe As"p Asr Arc Ala Pro Asn Thr Ala Glu Leu 

, «, . ; 7 5 » s . 

-0 AAG ATC TGC CGA GTG AAC CCA AAC TCT GGC AGC TGC CTC GGT GGG GAT 

Lys He Cys Arc Val Asr. Arc Asn Ser Gly Ser Cys -eu oiy ^ ~s K - 
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190 19- 



_ — A — 3 7GT GAC AAG GTG GAG AAA GAG GAG ATT GAG GTC 

r~ Leu Leu Cys Asp lys Val Gin Lys Glu Asp He G.u Val 

5 2c: 205 — 

^ - . — r ~- GGG TGG GAG G33 GGA GGG TGG TTT TGG 3AA 33T 

Pr.e Tr.r G?y Pre cly Trp Glu Ala Ar 5 Gly Ser Phe Ser Gin ,1a 



-If '2C 22 = 



10 



GAT GTG CAC GGA CAA GTG GGC ATT GTG TTG GGG AGC GCT GGG TAG GGA 

13C =25 
15 GAG GGG AGG GTG GAG GGT GGT GTG GGT GTG TGG ATG GAG GTG GGG GGG 



S k2: j£ sc; ;;r e; :i. Ph . ^ ^ *™ ?r = r y . m ? 

... TGG ATG GAG GTG GGG GGG 
Ts'p Pre It: Sir. Ala Cr: Val Arg Val Ser Met Gin Leu Ar, Ar; 

3AC C3G — CTG AGT GAG CCC ATG GAA TTC GAG TAG GTG GGA 

20 Pre sir Asc Arc 31, Leu Ser Glu Pre Met Glu Phe Glr. Tyr Leu Pre 

I 7 Vj *- ' - 

— A -A GAC GAT ~3T GAG GGG * TT GAG GAG AAA CGT AAA AGG AGA TAT 
t% Tr.r Asc As p Arc K,s Arc I.e G-. u 31 - Lys Arg Lys Ar, Thr Tyr 

25 2sc 

GAG ACC TTC AAG AGG ATG ATG AAG AAG AGT GGT TTC AGG GGA CCC ACC 

gTu Thr Phe Lys Ser l.e Me, Lys Lys Ser Pre Phe Ser Gly Pre Thr 

: cc 3C: J05 

30 ,- AC CCC GGG GGT CCA CGT GGA CGC ATT GGT GTG CGT TGG CGC AGC TCA 

ifz Pre Arc ?rc Pro Pre Arc Arc -e A., Val Pro Ser Ar, ,er Ser 

310 3IS — 2 - 

_ GTC ccc AAG GGA GGA GCG GAG GGG TAT GCG TTT AGG TCA TCG 

Ilk slz val Pre Lys Pre Ala Pro Gin Pre Tyr Pro Phe Thr ,er ,er 



GAC CCC GGG 



3 3< 



TG AGG ACG ATG AAG TAT GAT 3 AG TTT GCG ACG ATG GTG 



GC 



40 Lee Ser ?hr Tie Asr. Ty'r Asp Glu Pne Pre Thr Me, Val Phe Pro Ser 

GGG GAG ATG AGC GAG GGG TGG 3GC TTG GGC CCG CCC GGT GGC CAA GTC 

G - He Ser Glr. A. a Ser Ala Leu Ala Pro Ala Pro Pro G.n Val 

45 ~" 36C 3-: 

„ — A 3 — CCT GCG GGT GCT CCA GCG ATG GTA TCA GGT 

: e> = -c Gin Ala ?r = Ala Pre Ala Fro Ala Pre Ala Me, Val Ser A. a 

g 75 280 3e5 

50 — - GC- CAG GCG CCA GCG CCT GTG GGA GTG CTA GGC GGA GGC GCT CCT 

Leu All Glr. Ala Pre A. a Pro Val Pro Val Leu Ala Pro Gly Pro Pre 

390 3 9: -0 

5 < -A- — G~G GC! CCA GCT GCP GGC AAG GGC ACG 3 AG GC~ GGG GAA GGA 

Gin La Val A. a Fr: Pro A, a Pro lys Pre Thr Glr A. a Gly G ; u o.y 

i : ■: 4 1 : 



. GAG GC:: — 3 CTG GAG CTG 3AG TTT GAT GAT GAA GAC --o 

60 Thr leu Ser 3iu Ala leu Leu Glr. Leu Glr. Pne Asp Asp G.u Asp Leu 
. 2 < 3 - "* J - 



TGG GC~ TTG 3TT GG3 AA3 AGG ACA GAC CCA GCT GTG ~TC AGA GAC CT3 

Gly All Leu Le- Gly Asr. Ser Tnr Asp Pre A.a Val Pre Thr Asp -eu 

65 4 40 

~- A g — 3AC AA3 TGG GAG TTT SAG GAG GTG CTG AAC CAG GGG ATA 

Va Se- Val Asr Asr. Ser Glu Pne Gin 31n Leu Leu Asr. G.n G.y - ie 

455 <6r 



CO 



:3T 37 G GCG CCS CAC AGA ACT GAG 333 ATG CTG ATG GAG TAG CCT GAC 
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p-o "al Ala Pre his Thr TM Glu Fr: Met Leu Met Glu Tyr Pre 



,- : 475 433 485 



r ACT CGZ CTA GTG ACA GGG GAG AGG GGG GGG GAG GGA GGT GGT 

5 Ala lie Thr Arc; leu Val Thr Ala Gin Arg Pro Pro Aso Pro A.a Pre 



4 9 



lie Fr: Gl; 
05 21 



Lva He As r. Glv Tyr Thr Gly Pre Sly Thr Val Are lie Ser leu V 3 . 

" 5C 55 

-0 TOr Lys Asp Pre Fre his Are Pre hi. Pre His Glu Leu Val Gly Lys 

65 ^: r 



154 1 



155 =• 



— CCA G-G GGG GGG GCG GGG CTC CGI AAT GGC CTG GTT TGA GGA GAT 

Xla P^o leu Sly Ala Pro Gly Leu Pro Asr, Gly Leu Leu Ser G.y Asc 
10 *C = 510 

-AA GAC ~*""C TCG TCG ATT GGG GAG ATG GAG TTG TCA GCG CTG GTG ACT l€2' 

G~ ls P Ph. Ser Ser lie A.a Asp Met Asp Fn. Ser Ala Leu ,e. Ser 

52C 52o =>-^ 

15 CAG ATG AGG TCC TAAGGGGGTG AGGCCTGCCC TGGCCAGAGG ACTGGTTGGA 16- 
Glr. lie Ser Ser 
525 

20 GGGGATTGAA GCCCTCGAAA AGCACTT ACS GATTCTGGTG GGGTGTGTTC CAACTGCCCC 1^9 
CAACTTTGTG GATGT GTT CC TTGGAGGGGG GAGGCATATT TTATTCTTTT ATTGTCAGTA 180 9 

„ CT AT CT 1 TCT GT GTTTT TGGAGGTGGT TAAGCASAAG CATTAACTTG TGTGGAAAGG 

" 33GGAGGTGG GG AAA. GTG AA AGTTTTCCG G TGTCCTGATG GTCAGCTCCG TTCTGTGTAG Irl 
GGAAGTGTG G GGTGGGGGAT CCGGATCCTC CAGCTTCTGG TACTCT GGTA GAGACAGAAG 

-0 GAGGCTGGAG GTAAGGGGTT TGAGGGGAGA AAGCCTTATG AAGTGTCTTC CATCATGGAT 
TGATTACAGG TTAATGAAAA TAACGCCCCA GATACCAGCG GCTGTATGGG ACTGGCATTG 
TGGGTGTGGG TAAGAGGAGG GTTTGAGGGS CTGGCTTCCT GCCCTACAGA GGTCTCTGCC 

" GGCTCTTTGG TTGGTG AA.GG ATGGGTG AAG GAAACAGTGG AACAGCACTG GCTCTCTCCA ZZZ~- 
GGATCGAGAA GGGGTTTGGT GTGGAGTT GG TTGCTCT GGG CTCTTCTGAA GTGGCTTAAT 

_ , ~~AGGCTGC~ AG CT CTC CAG TCAGGAGGCA 

40 AGTAGGGT AA GT . o . . . ^« 1 



1S69 



2 0 4 c 

Z 1 C r ■ 

z i 



::e9 
2 24 c 



AG T GAA G A_AT GA. AAG GACTTGG ACTCTTGCTC TTTCT AGTCT GAACTAATAA 2,1, 
AGGTGTTGGG AAG GTG GAG G 3GAGGAGCTC GTGCC 2AA " 

45 

C IN FORMAT Z ON FTP SE1 ZZ NO:2: 

SECL'ENGE GhARAGTEF.I STICS : 

LENGTH: 5 2" arairtc aoidf 
( 5 TYPE : a-i r.o ac:c 
(C TOPOLOGY: linear 

MOLE GC LE TYPE, protein 

xl : 3e:gen:p gesc?.:ft:on : sec. :dnc;2: 

Met Aso Glu Leu Foe Frz leu l.e Pr.e Pro Ala Glu Glr. Pre Lys Glo 
r i L ~ 

60 Are Giv Met Are Foe Are Tyr Lys Cys Glu Gly Are Ser Ala Gly Ser 

2 z 2 - J " 

Glu Are Ser Tor Asp Thr TOr Lys Tor His Pre Tor He 
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^ sr Cvs Ara Asc Gly ?ne Tyr Glu Ala Glu Leu Cys Pre Asp Arc Cys 

55 ^ 

« lie riis 5er ?ne Sir. Asn Leu Sly lie Glr. Cys Val Lys lys Arc As; 

::; 1-? 

Leu Glu Gin Ala lie Ser Gin Arg lie Gin Tnr Asn Asr. Asn Pre F.ne 
11: — C 

: ° - - - val ?r: lie SI- Glu Gin Arc Hy Asp Tyr Asp Leu Asn Ala Val 
:3C 135 140 

Arc Leu Cvs Pr.e Glr. val Tr.r Val Arc Asp Pro Ser Gly Arq Pro Leu 

A~c Leu Pre Pre Val Leu Ser Kis Pre He Phe Asp Asn *rq Ala Pre 
16 5 1~C * 3 

-0 Asn Tr.r Ala 31- Leu Lys lie Cys Arc Val Asr, Arq Asn Ser Gly Ser 

1EC 195 - - L 

Cvs Leu Gly Gly As, Glu He Pne Leu Leu Cys Asp Lys Val Gin Lys 
195 

~- A<c - e Giu Val Tvr Pr.e Tr.r Giy Pro Gly Trp Glu Ala Arc Gly 

21C — ' 220 

Se - rr.e Ser G.r. A-« Asp Val His Arc Gin Val Ala lie Val ?he Arc 

30 225 22: 235 

Thr Pro Pre Tyr Ala Asp Pre Ser Leu Gin Ala Pro Val Arq Val Ser 
14 5 2 5 C ^ - - 

3< Me' Gin Leu Arc Arc Pre Ser Asp Arc Glu Leu Ser Giu Pre Me: Giu 

2 6. - * - 2 ' C 

r> he Gin Tvr Leu Pro Asp Tr.r Asp Asp Arc His Arc lie Glu Glu Lys 
2" 5 : ~C 285 



40 



Arq Lys Arq Tr.r Tvr Glu ^ 3Q0 



Pr.e Lys Ser He Met Lys Lys Ser Pro 



-he Se- Glv Pre Tr.r Asr ? ro Arc Prr Pre Pro Arc Arc lie Ala Val 

45 225 2: - 

Prc Ser Arc Ser Ser Ala Ser Val Frc Lys Pro Ala Pro Gin Pro Tyr 

ill 33 0 u j5 

SO Pro Phe Thr Ser ier Leu Ser Thr He Asn Tyr As P Glu Phe Pro Thr 

34, 34 5- 2bl 

Me' Vai Phe Pro Ser Glv Sir. He Ser Gin Ala Ser Ala Leu Ala Pro 

355 3€C 365 

AH Gin val leu Prr Sir Ala Pro Ala Frc Ala ?:: Ala Pre 

2-0 :"s 360 

Ala Met Val Ser AH Leu Ala Glr. AH Pro Ala Pro Val Pro Vai Leu 

60 365 295 -C 

Ala Pro Gly ?r: He Sin Ala Val Ala Pro Pro Ala Pro Lys Pro Thr 



4 



65 Glr; Ala Giy Glu Gly Thr Leu Ser Giu AH Leu Leu Glr. Leu Gin Phe 

a:: 425 

Asn Asc Giu Asn Leu Gly Ala Leu Leu Gly Asn Ser Thr Asp Pro Ala 

" 44C 445 

Val Phe Thr Asr Leu Ala Ser a 1 Asp Asn Ser Glu Phe Gin Hr. _eu 
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Leu ash 31, Gly :-e Prr V,: Ala Frc His Tr.r Tr.r 3- ?== «e-. -eu 

46i 4 - 

5 Me t Giu Tyr Pre -• - - - 55 ~« Tr.r ~- ^ V *- Ala ^ 

4 5 i 4 3 ~ 

Fro Asp ?:c Ala Pre A_a Pre Leu G 1 > A. 3 Pre 3_> Leu ?:: Asr Hy 
Le; Leu Ser Gly Asp Glu Asp Pr.e Se: Ser He Ala Asp Met Asp Pr.e 
' s Ala Leu Leu Ser Sir. He Ser Ser 
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Claims: 



1 A nuclei acid encoding a chimeric transcription activator protein which: 

, a i activates transcription of a gene to which the chimer.c transcription 

« activator protein is targeted, ana. 

(b) contains at least one composite transcription activation domain comprising 
a continuous polypeptide region containing two or more component polypeptide 
regions, at least two of which are mutually heterologous and do not occur in nature 
inthe same gene product, and at least one additional domain comprising a DNA- 

10 binding domain. 



:o 



2 A nucleic acid encoding a chimeric transcription activator protein which: 

I a ) actn ates transcription of a gene to which the chimer.c transcription 
activator protein is targeted: and, 
, 5 (b) contains at least one composite transcription activation domain comprising 

a continuous polvpepnde region containing three or more component polypeptide 
regions, at least two of which are mutually heterologous, and at least one additional 
domain which is heterologous with respect to at least one of the component 
polypeptide regions of the composite transcnpnon activation domain. 

7 . A nucle.c acid of claim 2 which encodes a chimeric transcnpnon act.vator protein 
which further contains at least one heterologous domain which binds to a cell oermeant 
ligand. 

4. A nucleic acid of claim : which encodes a chimeric transcnption actuator protein 
25 which further contains at least on heterologous domain which comprises at leas: one DNA- 

binding domain 

5. A nucleic acid encoding a chimeric transcnption activator protein wnicn. 

la) activates transcnption of a gene to which the chimeric transcnpnon 
activator protein is targeted, and, 
30 (b ) contains at least one composite transcription activation domain compnsmg 

a continuous polypeptide region containing wo or more component polypeptide 
regions, at least rwo of which are mutually heterologous, and at least one additional 
domain which is heterologous with respect to at least one of the component 
polypeptide reg.ons of the composite transcription activation domain, wherein at 



PCT/US9-V15219 

WO 99/10508 

- 74 - 

least one of the heterologous domains is derived from an immunophilin. 
cyclophilm. calcineunn, FRAP or DNA g>Tase or from a receptor for tetracvciine 
or ecdysone or another steroid. 



A nucleic acid encoding a chimeric transcription activator protein which 

(a) activates transcription of a gene to which the chimeric transcription 
activator protein is targeted; and, 

(b) contains at least one composite transcription activation domain comprising 
a continuous polypeptide region containing two or more component polypeptide 
regions, a: least two of which are mutually heterologous, a bundling domain, and at 
least one additional domain which is heterologous with respect to at least one of 
the component polypeptide regions of the composite transcription activation . 
region. 



10 



15 



7. A nucleic acid composition comprising a first nucleic acid encoding a chimeric 
transcription activator protein of any of claims 3 or 5 (or a Ibd-dependent claim of 2 or 6) 
and a second nucleic acid encoding a chimeric DNA-bindmg protein comprising at least 
one ligand binding domain for a cell permeant ligand and at least one DNA-bindmg 
domain. 

o 8. A nucleic acid composition of claim 7 in which the chimeric DNA-binding protein 
comprises two or more ligand-bindmg domains. 

9. A nucleic acid composition of claim 7 or 8 which further comprises a target gene 
construct comprising a target gene operatively linked to an expression control sequence 
which includes a DNA sequence to the the chimeric DNA-binding protein binds 

25 10. A nucleic acid composition comprising a first nucleic acid encoding a chimeric 
transcription activator protein of any of claims 1 or 4 (or a DBD-dependent claim of 2 or 
6) and a target gene construe: comprising a targe: gene operatively linked to an expression 
control sequence which includes a DNA sequence to the the chimeric iranscnption 
activator protein binds. 

30 11. A host cell comprising a nucleic acid of any of claims 1 - 6. 

12. A host cell comprising a nucleic acid composition of any of claims 7 - 10. 

13. A host cell of claim 1 1 or 1 2 which contains a genomically-integrated targe: gene 

14. A method for producing a genetically engineered host cell, which method 
comprises introducing a nucleic acid of any of claims 1 - 6 into a host cell under 
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conditions permitting uptake by the cell of nucleic acids. 

1 5 Amethod for producing a genetically engineered host cell, which method 
comprises introducing a nucleic acid composit.on of any of claims 7-10 into a nosi cell 
under conditions permitting uptake by the cell of nucleic acids. 
,16 A method for effecting expression of a target gene which comprises maintaining 
geneticallly engineered host cells of claims 11 . 1 2 or 1 3 under conditions suitable for gene 
expression. 

] 7 A method of claim 1 6 in which the target gene expression is regulated, and the 
conditions for gene expression include the presence of a ligand which binds to one or more 
10 of the chimeric proteins 
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